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(57) Abstract 

The present invention relates to the design of trimeric polypeptides using polypeptide structural elements derived from the teU-anectin 
protein family, and their use in rational de novo design and production of multi-functional molecules including the application of the 
multi-functional molecules in protein library technology, such as phage display technology, diagnostic and therapeutic systems, such as 
human gene therapy and imaging. The trimeric polypeptides being constructed as a monomer polypeptide construct comprising at least one 
tetranectin trlmerising structural element (TTSE) which is covalently linked to at least one heterologous moiety, said TTSE being capable 
of forming a stable complex with two other TTSEs; or as an oligomer which is comprised of two monomer polypeptide consUiicts as 
mentioned above, and which comproses three TTSEs or a multiplum of three TTSEs. or which is comprised of three monomer polypeptide 
consuncts. 
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TRIMERISING MODULE 

The present invention relates to the design of trimeric 
polypeptides using polypeptide structural elements derived 
from the tetranectin protein family, and their use in 
5 rational de novo design and production of multi-functional 
molecules includng the application of the multi-functional 
molecules in protein library technology, such as phage 
display technology, diagnostic and therapeutic systems, such 
as human gene therapy and imaging. 

10 BACKGROUND OF THE INVENTION . 

Tetranectin is a Ca^^ -binding trimeric C-type lectin which is 
present in blood plasma and from the extracellular matrix of 
certain tissues. The tetranectin group of proteins comprises 
tetranectin isolated from man and from mouse and the highly 
15 related C-type lectin homologues isolated from the cartilage 
of cattle (Neame and Boynton, database accession number 
PATCHX:u22298) and from reef shark (Neame et al., 1992, Neame 
et al., 1996 and database accession number p26258 and 
PIR2 :A37289) . 

20 The mature tetranectin polypeptide chain of 181 amino acid 
residues is encoded in three exons as shown by molecular 
cloning and characterisation of the gene (Berglund & Peter- 
sen, 1992; Wewer & Albrechtsen, 1992) . Exon 3 of the human 
tetranectin gene encodes a separate functional and structural 

25 unit, a single long-form so-called carbohydrate recognition 
domain (CRD) , with three intra-chain disulphide bridges. The 
tetranectin CRD is considered to belong to a distinct class 
of C-type lectins (Day, 1994) clearly related to C-type 
lectins by sequence homology, conservation of disulphide 

30 topology (Fuhlendorff et al, 1987) and by the presence of an 
almost conserved suit of amino acid residues known to be 
involved in binding of calcium ions . 
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A published poster (Holtet et al 1996) has proposed tetranec- 
tin to be a trimer and that trimerisation is governed by the 
peptide encoded by exon 1 . The peptide encoded by exon 1 was 
proposed to be "necessary and sufficient to govern trimerisa- 
5 tion" whereas the polypeptide encoded by exon 2 was proposed 
as being "involved in lysine-sensitive binding to 
plasminogen" . 

Tetranectin was first identified as a plasma protein binding 
to plasminogen by binding to the kringle-4 domain of 
10 plasminogen. Recent unpublished results (Graversen et al . , 
manuscript for PNAS) proves (1) that the site in tetranectin 
involved in binding to plasminogen resides entirely in the 
CRD-domain (encoded by exon 3), (2) that binding is calcium 
sensitive, and (3) that the kringle-4 binding site in 
15 tetranectin overlaps the putative carbohydrate binding site 
of the CRD domain. Hence, there is now surprising definitive 
evidence that TN exons 1 and 2, i.e. the trimerisation unit 
in TN does not exihibit any plasminogen-binding affinity. 
Accordingly, an artificial protein containing a TTSE unit as 
20 part of its architecture is no£ expected to interact with 
plasminogen or plasmin due to properties inherited from 
tetranectin. 

Tetranectin has also been reported to bind to sulfated 
polysaccharides like heparin (Clemmensen (1989) Scand J. 
25 Clin. Lab. Invest, vol 49:719-725) . We have new results 

showing that the CRD domains of tetranectin are not involved 
in this protein-polysaccharide interaction. In fact, the site 
in tetranectin is located in the N-terminal region of exon l 
and may be abolished by removal or mutagenis of N-termmal 
30 lysine residues (Graversen et al ., manuscript) . processes 

that do not inhibit trimerisation. TTSEs that include most or 
all of TN exon 1 therefore confer an affinity for sulfated 
polysaccharides to any designed protein which encompasses 
such a TTSE as part of its structure. If desired, however, 
35 this affinity can be reduced or abolished by N-terminal 

truncation or mutagenesis of lysine residues in the part of 
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the TTSE that corresponds to the N- terminal 8-10 amino acid 
residues of axon i (Graversenet al. unpublished) . 
With respect to gene therapy which is also withing the scope 
of the present invention, there is only a limited number of 
5 basic strategies for gene therapy which show some promise in 
preclinical models so far. The two major strategies e.g for 
the treatment of malignant tumors are cytokine -gene aided 
tumor vaccination and selective prodrug activation. Whereas 
the first strategy relies on the strong immunostimulatory 

10 effect of a relatively small number of genetically modified 
cytotoxic T cells or tumor cells, the second one is based on 
conversion of a nontoxic prodrug into a toxic product by an 
enzyme-encoding gene where the toxic effect is exerted also 
on non-transduced dividing tumor cells due to a so-called 

15 bystander effect. Alternatively, strategies can be envisaged 
where the malignant phenotype of a cell is reversed by either 
inactivating an oncogene or reestablishing an inactivated 
tumor suppressor gene. In both cases, highly efficient gene 
transfer to the cells in a tumor is required. Although high 

20 efficiencies of gene transfer can be obtained in vitro and 
even in vivo under certain circumstances, correction of the 
malignant phenotype by reversing the major oncogenic change 
in the tumor cells is unlikely to result in normal cells. 
Thus, selective induction of tumor cell death by use of the 

2 5 present invention would be preferable, and the development of 

methods enabling such induction will be of great importance. 

A major problem in connection with the gene therapy is the 
incorporation of foreign material into the genome. Viruses, 
however, have only been partially successful in overcoming 

3 0 this problem. Hence the initial efforts at gene therapy are 

still directed towards engineering viruses so that they could 
be used as vectors to carry therapeutic genes into patients. 
In the still very immature in vivo method of somatic gene 
therapy, where a vector could be injected directly into the 
3 5 bloodstream, or more preferably by transmucosal delivery, the 
present invention may be utilized due to the surprising 
number of ways the gene . therapy may be targeted. 
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For many gene-therapy applications in the future, it is 
probable that a synthetic hybrid system will be used that 
incorporates engineered viral component for target-specific 
binding and core entry, immunosuppressive genes from various 
5 viruses and some mechanism that allows site-specific 

integration, perhaps utilizing AAV sequences or an engineered 
retroviral integrase protein. In addition, regulatory 
sequences from the target cell itself will be utilized to " 
allow physiological control of expression of the inserted 
10 genes. All these components would be assembled in vitro in a 
liposome-like formulation with additional measures taken to 
reduce immunogenicity such as concealment by PEG 

As mentioned, one of the current problems in gene therapy is 
the efficient delivery of nucleic acids to as many as 

15 possible of a specific population of cells in the body, and 
it is often not possible to find e.g. an appropriate viral 
vector that will find that particular cell population 
efficiently and selectively (Review on aspects of gene 
therapy: Schaper, W & I to, W.D. Current Opinion in 

20 Biotechnology, 1996, vol. 7, 635-640. Nature Biotechnology 
1998 vol 16 is an entire volume dedicated to protein- and 
gene delivery) . 

Given the possibility of in vitro generation of a human 
antibody against virtually any target antigen by phage 
25 technology, it follows that TTSEs, where one of the subunits 
is modified with a membrane integrating or associating 
entity, may be used as a practicable tool for generating a 
viral, bacterial or preferentially artificially assembled 
lipomal vehicle that will allow selective -delivery of the 

30 contained material by" infection or trans feet ion of any cell 
population to which such a specific antibody may be 
generated. Moreover vehicles may, with the use of TTSEs, be 
individualised by selection of patient specific antibodies or 
by assembling TTSE units conjugated with scFvs selected from 

35 an ensemble of antibodies selected by the particular markers 
of the disease. 
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SUMMARY OF THE INVENTION 

It has surprisingly been found by the present inventors that 
the human tetranectin polypeptide (and derivatives thereof) 
is capable of forming very stable trimers which have a number 
5 of advantageous characteristics and uses. Notably, the tetra- 
nectin molecule includes a trimerising structural element - 
which can be used as carrier of other chemical entities, 
thereby providing a carrier molecule of a hitherto unseen 
versatility. 

10 Prior published knowledge in the field of providing trimeri- 
sing polypeptides of choice includes the disclosure in WO 
95/31540 by Hoppe and Reid which describes a trimerisation 
module derived from collectin coiled coil structures and its 
application in engineering of artificially trimerised pro- 

15 teins. Several interesting areas of application are common to 
that patent publication and to. the present disclosure. How- 
ever, in several ways the properties of the trimerisation 
modules derived from the tetranectin protein family as dis- 
closed herein are markedly different in fundamental architec- 

20 ture and represent surprisingly improved properties in com- 
parison with the collectin trimerisation unit: 

(1) Although the spatial structures of both trimerisation 
modules at a superficial level appear as similar in that both 
are ternary coiled coil structures of roughly equivalent 

25 spatial size the structural basis for adopting this spatial 
configuration is markedly distinct between the two groups of 
proteins. In fact, it is so distinct that the common belief 
prior to the work of Holtet at al . on cross-linking of human 
tetranectin (Holtet et al . , 1996) was that this family of 

30 proteins were tetrameric (hence the name) . Accordingly, the 
sequences of the tetranectin family of trimerisation modules 
does not conform to the declared common motif delineated for 
the collectin family (WO 95/31540, page 8). 
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(2) The thermal stability of the tetranectin trimerisation 
module (as shown in the examples) is such that the trimer can 
be shown to exist even at about 60^C {Example 4 , trimerised 
tetranectin) or at about 70^C (Example 3 , trimerised ubiqui- 

5 tin) , whereas a collectin trimer unit falls apart at about 
50-55<^C (WO 95/31540, Example 1, page 36 therein) . 

(3) Whereas it remains uncertain whether the collectin trime- 
risation domain possibly allows attachment of fusion partners 
at C- terminal ends of the trimerisation module, and whereas 

10 no example has been reported of successful or claimed suc- 
cessful attachment of a foreign protein (except for the GST 
fusion partner) to the N-terminal region of the collectin 
trimerisation module , the information disclosed herein demon- 
strates that the tetranectin trimerisation module is more 

15 versatile in that is allows attachment of foreign proteins to 
either, as well as to both, terminus or termini simultaneous- 
ly (Examples 1-4) . This has important consequences as the 
tetranectin trimerisation module may be deployed to construct 
molecules that are; able to interact (each end with a binding 

20 valency up to 3) simultaneously with two bulky interaction 
partners like e.g. cell surfaces . 

(4) The virtual absence of subunit exchange between monomers 
of a trimer that has been trimerised using the tetranectin 
trimerisation modules disclosed herein is by first principles 

25 of theinnodynamics correlated with the surprisingly high 

thermal stability of the complex. It will hence be apparent 
that the advantages inherent to the "pick-and-mix" applica- 
tions of the technology, as disclosed herein, may be used to 
much greater advantage because of the much longer shelf life 

3 0 expected for the heterofunctional products of the present 
invention. 

The polypeptide constructs CI IH6PXTN123 , H6FXTN123, H6PXTN12, 
and H6FXTN23 which all involve parts of the tetranectin 
molecule have previously been prepared (cf . e.g, WO 94/18227) 
35 but these constructs have all been provided with a view to 
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facilitating expression and/or purification of the tetranec- 
tin derived moiety of the constructs. To the best of the 
inventors' knowledge no publications exist which reports any 
use of tetranectin derivatives as "building blocks" to which 
5 other chemical moieties advantageously could be coupled. 

Hence, in its broadest aspect the present invention relates 
to a monomer polypeptide construct comprising at least one- 
tetranectin trimerising structural element (hereinafter 
designated a TTSE) which is covalently linked to at least one 

10 heterologous moiety, said TTSE being capable of forming a 
stable complex with two other TTSEs, with the proviso that 
the heterologous moiety is different from any of the fusion 
proteins CIIH6FXTN123 ^ H6FXTN123, H6FXTN12, H6FXTN23, the 
sequences of which are shown in SEQ ID NOs : 24-27. It is 

1 5 preferred that the heterologous moiety is one which does not 
exclusively facilitate expression and/or purification of the 
monomer polypeptide construct . 

The invention further relates to oligomer ic molecules com- 

20 prised of at least two of such monomer polypeptide con- 
structs, and the invention also relates to methods of prepa- 
ring the monomer polypeptide constructs and the oligomers. 
The invention further relates to a kit comprising monomer 
polypeptide constructs in separate packages, ready for use in 

25 a "pick-and-mix" approach for use of the monomers. This pick 
and mix approach is for use in therapeutic as well as 'for 
dianostic use. Eg. having a tumor with a known and specific 
epitope to which an antibody is avaible. The kit may then 
comprise a first TTSE conjugated to an relevant antibody, a 

30 second component which may comprise af TTSE coupled to an 

imaging compound. In a second aspect the second component may 
comprise a drug or a prodrug relevant for treating the tumor. 
In a still further aspect a third TTSE component being a 
monotoring compound, e.g. indicative of the progress of the 

35 targeting. 
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As is clear from this illustration the present invention 
relates to a large amount of combinations allowing an 
individul design for a large number of circumstances . 

Finally, the invention also pertains to fragments which 
5 include nucleic acid sequences which encode the monomer 
polypeptide constructs, as well as to vectors and cells 
containing these nucleic acid fragments. 

LEGENDS TO THE FIGURES 

Fig . 1 : Amino acid sequence of the amino terminal region of 

10 tetranectin. 

Amino acid sequence (in one letter code) from El to L51 of 
tetranectin (SEQ ID NO: 7) . Exon 1 comprises residues El to 
D16 and exon 2 residues V17 to V49, respectively. The alpha 
helix extends beyond L51 to K52 which is the C- terminal amino 

15 acid residue in the alpha helix. 

Fig. 2: Alignment of the amino acid sequences of the trimeri- 
sing structural element of the tetranectin protein family. 
Amino acid sequences (one letter code) corresponding to 
residue V17 to K52 comprising exon 2 and the first three 

20 residues of exon 3 of human tetranectin (SEQ ID NO: 7) ; 
murine tetranectin (Sorensen et al . , Gene, 152: 243 -245, 
1995) ; tetranectin homologous protein isolated from reef - 
shark cartilage (Neame and Boynton, 1992,1996) ; and tetranec- 
tin homologous protein isolated from bovine cartilage (Neame 

25 and Boynton, database accession number PATCHX :u22298) . Resi- 
dues at a and d positions in the heptad repeats are listed in 
boldface. The listed consensus sequence of the tetranectin 
protein family trimerising structural element comprise the 
residues present at a and d positions in the heptad repeats 

3 0 shown in the figure in addition to the other conserved resi- 
dues of the region, "hy" denotes an aliphatic hydrophobic 
residue. 
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Fig. 3: Construction of the expression plasmids pTH6PXtripa 
and pTH6FXtripb . 

The amplified DNA fragments tripa and tripb harbouring the 
tetranectin amino acid sequence (SEQ ID NO: 7) from El to T48 
5 and El to K52, respectively, fused in the 5' end to 

nucleotide sequences encoding a EX, cleavage site IQGR (SEQ 
ID NO: 4) and the recognition sites for the restriction 
endonucleases Bglll and Kpnl, were cut with the restriction 
enzymes Bell and Hindlll and ligated into the BamHI and 
10 Hindlll sites of the expression plasmid pT7H6 (Christensen et 
al., 1991) using standard procedures . 

Fig . 4 : Predicted amino acid sequence of the fusion proteins 
HGFXtripa (SEQ ID NO: 28) and HSFXtripb (SEQ ID NO: 29) 
encoded by the expression plasmids pTH6FXtripa and 
15 pTH6FXtripb; respectively. 

Fig. 5: Construction of the expression plasmids pTH6FXTN123 
and PTCIIH6FXTN123 . 

The amplified DNA fragment corresponding to the full length, 
mature tetranectin monomer (SEQ ID NO: 7) from El to V181 

20 fused in the 5' end to nucleotide sequences encoding a EX, 

cleavage site lEGR (SEQ ID NO: 10) was cut with the restric- 
tion enzymes BamHI and Hindlll and ligated into the corre- 
sponding sites of the expression plasmids pT7H6 (Christensen 
et al . / 1991) and pTCIIHS using standard procedures . pTCIIH6 

25 was derived from pT7H6 by siibstitution of the Ndel - Hindlll 
fragment of pT7H6 with the Ndel - Hindlll fragment of pLcII 
(Nagai and Thogersen, 1987) encoding the first 32 residues of 
the lambda oil protein MVRANKRNEALRIESALLNKIAMLGTEKTAEG (SEQ 
ID NO: 11) fused in the 3' end to a nucleotide sequence 

30 encoding the H6 sequence GSHHHHHHGS (SEQ ID NO: 12) .* 

Fig. 6: Predicted amino acid sequence of the fusion proteins 
H6FXTN123 (SEQ ID NO: 25) and CIIH6FXTNi23 (SEQ ID NO: 24) 
encoded by the expression plasmids pTH6FXTN123 and 
pTCIIH6FXTN123, respectively. 
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Fig. 7: Construction of the expression plasmids pTH6FXTN12, 
pTH6FXTN23, and pTH6FXTN3 . 

The amplified DNA fragments corresponding to the tetranectin 
derivatives TN12 and TN3 from El to V49 and A45 to V181, 
5 respectively (SEQ ID NO: 7) fused in the 5' end to nucleotide 
sequences encoding the FX, cleavage site lEGR (SEQ ID NO: lo) 
was cut with the restriction enzymes BainHI and Hindi II and 
ligated into the corresponding sites of the expression plas- 
mids pT7H6 (Christensen et al,, 1991) using standard pro- 

10 cedures. The amplified DNA fragment corresponding to the 

tetranectin derivative TN23 from V17 to V181 (SEQ ID NO: 7) 
fused in the 5 ' end to nucleotide sequences encoding the FX, 
cleavage site IQGR (SEQ ID NO: 4) was cut with the restric- 
tion enzymes BamHI and Hindi I I and ligated into the corre- 

15 spending sites of the expression plasmids pT7H6 (Christensen 
et al . , 1991) using standard procedures . 

Fig. 8: Predicted amino acid sequence of the fusion proteins 
H6FXTN12 (SEQ ID NO: 26), H6FXTN23 (SEQ ID NO: 27), and 
H6FXTN3 (SEQ ID NO: 30) encoded by the expression plasmids 
20 PTH6FXTN12, pTH6FXTN12, respectively. 

Fig. 9: Gel filtration analysis of TN123, TN23, and TN3 
Analytical gel filtration of the recombinant tetranectin 
derivatives TN123, TN23, and TN3 were performed on a Superose 
12 HR 10/30 column (Pharmacia, Sweden) with a total volume of 
25 25 ml in 100 mM NaCl and 5 0 mM Tris-HCl pH 8 and a flow rate 
of 0.2 ml/min. Vertical bars at peak maxima identify elution 
profiles for each of the three proteins. 

Fig. 10: Cross-linking analysis of TN123 and CIIH6FXTN123 . 
Samples of TN123, CIIH6FXTN123 and mixtures of both were 

30 inciibated with DMSI and analyzed by SDS-PAGE (12% gel) . 

Before addition of DMSI, protein mixtures were subjected to 
subunit exchange by incubation at 70^=^0 for varying length of 
time. Protein marker of 94, 68, 43 and 30 kOa, top to bottom 
(laneM) . CIIH6FXTN123 fusion protein (lane l) . TN123 (lane 

35 2) . DMSI treated CIIH6FXTN123 (lanes 3 and 6) . DMSI-treated 
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TN123 (lane 4) . Identical samples of DMSI treated mixtures of 
CIIH6FXTN12 3 and TN12 3 without heat exposure (lanes 5 and 7) 
and heat treated for 2.5 sec, 15 sec, 2.5 min. and 10 min., 
respectively/ before treatment with DMSI (lanes 8-11) . 

5 Fig. 11: Cross-linking analysis of the recombinant tetranec- 
tin derivatives TN123, TN23, TN3 , and H6FXTN12. 
The recombinant proteins TN123, TN23,TN3, H6FXTN12 or mix- 
tures of TN123 and each of the other were analyzed by SDS- 
PAGE. Protein marker of 94, 68, 43, 30, 20, and 14.4 kDa, top 

10 to bottom (laneM) . TN123 cross-linked with DMSI (lane 1) . 
TN123 and H6-rTN12 cross-linked with DMSI without and with 
heat treatment atWO^C for two min. (lanes 2 and 3) . H6FXTN12 
cross-linked with DMSI (lanes 4 and 5) . Mixture of TN123 and 
H6FXTN12, no cross-linking (lane 6). Cross-linking of TN123 

15 and TN23 without and with heat treatment at lO^'C for two min. 
(lanes 7 and 8) . Cross- linking of TN23 (lane 9) . Mixture of 
TN123 and TN23 without cross-linking (lane 10) . TN123 cross- 
linked by DMSI (lane 11) . Cross.-linking of TN123 and TN3 
without and with heat treatment for two min. (lanes 12 and 

20 13) . Cross-linking of TN3 (Lane 14) .Mixture of TN123 and 
TN3, no cross-linking (lane 15) - 

Fig. 12 : Cross-linking based analysis of the trimer thermal 
stability. 

In parallel experiments TN12 3 and the fusion protein 
25 H6FXtripb-UB (SEQIDNO: 31) were cross-linked with DMSI at 
different temperatures and the samples analyzed by SDS-PAGE. 
Protein marker of 94, 68, 43, 30, 20, and 14.4 kOa, top to 
bottom (laneM). TN123 without cross-linking (lane 1) . TN123 
cross-linked with DMSI for 15 min, at 37«C, SO^'C, 60<>C, and 
30 70 °C (lanes 2 to 5) , respectively. The fusion protein 

H6PXtripb-UB (SEQIDNO: 31) without cross-linking (lane 6). 
H6FXtripb-UB cross-linked with DMSI for 15 min. at 37**C, 
50'=^C, 60°C, and 70^C (lanes 7 to 10), respectively and 
H6FXtripb.-UB incubated at lO^C for 15 min. (lane 11) . 



wo 98156906 PCT/DK98/00245 

12 

Fig. 13: Construction of the expression plasmid pTH6PXtripb- 
UB, 

The amplified DNA fragment comprising the nucleotide sequence 
(SEQ ID NO: 16) encoding the ubiquitin amino acid sequence 
5 (SEQ ID NO: 19) from Q2 to G76 was cut with the restriction 
enzymes Bell and Hindlll and ligated into the BamHI and 
Hindlll sites of the expression plasmid pT7H6FXtripb (Example 
1) using standard procedures. 

Fig. 14: Predicted amino acid sequence of the fusion protein 
10 H6FXtripb-UB (SEQ ID NO : 31) encoded by the expression 
plasmid pTH6FXtripb-UB . 

Fig. 15: Construction of the expression plasmid pTH6FXscFV 
(CEA6) tripb. 

The DNA fragment, amplified with the primer pair SEQ ID NOs : 
15 21 and 22, comprising the nucleotide sequence SEQ ID NO: 20 
encoding the single chain antibody CEA6 , scFV (CEA6) , amino 
acid sequence from Ql to A261 was cut with the restriction 
enzymes BamHI and Kpnl and ligated into the Bglll and Kpnl 
sites of the expression plasmid pT7H6FXtripb (Example 1) 
20 using standard procedures . 

Fig. 16: Predicted amino acid sequence of the fusion protein 
H6PXscFV(CEA6) tripb encoded by the expression plasmid 
pH6FXscFV(CEA6) tripb. 

Fig. 17: Construction of the expression plasmid 

25 pTH6FXtripbscFX(CEA6) . 

The DNA fragment, amplified with the primer pairs having SEQ 
ID NO: 21 and 23, comprising the nucleotide sequence (SEQ ID 
NO: 20) encoding the single chain antibody CEA6; scFV (CEA6) , 
amino acid sequence from Ql to A261 was cut with the restric- 

3 0 tion enzymes BamHI and Hindlll and ligated into the BamHI and 
Hindlll sites of the expression plasmid pT7H6PXtripb ^ (Example 
1) using standard procedures. 



wo 98/56906 PCT/DK98/00245 

13 

Fig. 18: Predicted amino acid sequence of the fusion protein 
H6FXtripbscFv(CEA6) encoded by the expression plasmid 
pH6FXtripbscFv(CEA6) . 

Fig. 19: Construction of the expression plasmid . 
5 pTH6PXscFv(CEA6) tripbscFX(CEA6) . 

The DNA fragment, amplified with the primer pair SEQ ID NO: 
21 and 22, comprising the nucleotide sequence (SEQ ID NO: ' 20) 
encoding the single chain antibody CEA6, scFV {CEA6) , amino 
acid sequence from Ql to A261 was cut with the restriction 
10 enzymes BamHI and Hindi II and ligated into the BamHI and 
Hindu I sites of the expression plasmid 

pT7H6FXscFv(CEA6) tripb (Example 4) using standard procedures . 

Fig. 20: Predicted amino acid sequence of the fusion protein 
H6FXscFv{CEA6) tripbscFv(CEA6) (SEQ ID NO : 34) encoded by the 
15 expression plasmid pH6FXscFv (CEA6) tripbscFv (CEA6) . 

Fig. 21: Cross-linking analysis of the H6FXtripbscFv(CEA6) 
fusion protein (SEQ ID NO: 33). 
In parallel experiments the fusion proteins 
H6FXtri*pbscFv(CEA6) (SEQ ID NO: 33) and TN123 were cross- 

20 linked at room temperature for 30 min. with 0 mg/ml, 0.5 

mg/ml, 1.0 mg/ml, 1.5 mg/ml, and 2.0 mg/ml of DMSI, respec- 
tively. Lane 1: H6FXtripbscFv(CEA6) without DMSI, 
. H6FXtripbscFv(CEA6) with 0.5 mg/ml DMSI (lane 2) , 
H6FXtripbscFv(CEA6) with 1 . 0 mg/ml DMSI (lane3), 

25 H6FXtripbscFv(CEA6) with 1.5 mg/ml DMSI (lane 4) and 

H6FXtripbscFv(CEA6) with 2.0 mg/ml DMSI (lane 5) . Protein 
marker of 94, 68, 43, 30, 20, and 14.4 kDa, top to bottom 
(laneM). Lane 6: TN123 without DMSI, TN123 with 0.5 mg/ml 
DMSI (lane?), TN123 with 1 . 0 mg/ml DMSI (lane 8) TN123 with 

30 1.5 mg/ml DMSI (lane 9) and TN123 with 2 . 0 mg/ml DMSI (lane 
10) . 



DETAILED DISCLOSURE OF THE INVENTION 
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The term "trimerising structural element" (TTSE) used in the 
present description and claims is intended to refer to the 
portion of a polypeptide molecule of the tetranectin family 
which is responsible for trimerisation between monomers of 
5 the tetranectin polypeptide. The term is also intended to 

embrace variants of a TTSE of a naturally occurring tetranec- 
tin family member, variants which have been modified in the 
amino acid sequence without adversely affecting, to any 
substantial degree, the trimerisation properties relative to 

10 those of the native tetranectin family member molecule. 
Specific examples of such variants will be described in 
detail herein, but it is generally preferred that the TTSE is 
derived from human tetranectin, murine tetranectin, C-type 
lectin of bovine cartilage, or C-type lectin of shark carti- 

15 lage. Especially preferred is monomer polypeptide constructs 
including at least one TTSE derived from human tetranectin. 

The 49 residue polypeptide sequence encoded by exons l and 2 
of tetranectin (Fig. i) appears to be unique to the tetranec- 
tin group of proteins (Fig. 2) as no significant sequence 

2 0 homology to other known polypeptide sequences has been estab- 
lished. In preparation for experimental investigations of the 
architecture of tetranectin a collection of recombinant 
proteins was produced, the collection including complete 
tetranectin, the CRD domain (approximately corresponding to 

2 5 the polypeptide encoded by axon 3) , a product corresponding 
to the polypeptide encoded by exons 2+3, a product correspon- 
ding to exons 1+2 (Holtet et al., 1996; Example 2) . As 
detailed in Example 2 we now know differently: tetranectin is 
indeed a trimer, but the exon 2 encoded polypeptide is in 

30 fact capable of effecting trimerisation by itself as evi- 
denced by the observation that the recombinant protein corre- 
sponding to exons 2+3 is in fact trimeric in solution. 

BD-structure analysis of crystals of full-length recombinant 
tetranectin (Nielsen et al., 1996; Nielsen, 1996; Larsen et 
35 al., 1996; Kastrup, 1996) has shown that the polypeptide 
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encoded in exon 2 plus three residues encoded in axon 3 form 
a triple alpha helical coiled coil structure. 

From the combination of sequence and structure data it 
becomes clear that trimerisation in tetranectin is in fact 
5 generated by a structural element (Fig. 2), comprising the 
amino acid residues encoded by exon two and the first three 
residues of exon 3 by an unusual heptad repeat sequence, that 
apparently is unique to tetranectin and other members of its 
group: This amino acid sequence (Fig. 2) is characterised by 

10 two copies of heptad repeats (abcdefg) with hydrophobic 
residues at a and d positions as are other alpha helical 
coiled coils. These two heptad repeats are in sequence fol- 
lowed by an unusual third copy of the heptad repeat, where 
glutamine 44 and glutamine 4 7 not only substitute the 

15 hydrophobic residues at both the a and d position, but are 
directly involved in the formation of the triple alpha 
helical coiled coil structure. These heptad repeats are 
additionally flanked by two half-repeats with hydrophobic 
residues at the d and a position, respectively. 

20 The presence of beta-branched hydrophobic residues at a or d 
positions in alpha helical coiled coil are known to influence 
the state of oligomerisation. In the tetranectin structural 
element only one conserved valine (number 37) is present. At 
sequence position 29 in tetranectin no particular aliphatic 

25 residue appears to be preferred. 

In summary, it is apparent that the triple stranded coiled 
coil structure in tetranectin to a large extent is governed 
by interactions that are unexpected in relation to those 
characteristic among the group of known coiled coil proteins. 

30 The TTSEs form surprisingly stable trimeric molecules . 

(Examples 2 , 3 and 4) . The experimental observations, that 
(1) a substantial part of the recombinant proteins exists in 
the oiigomeric state of - and can be cross-linked as - 
trimeric molecules even at' 70 ®C and (2) that exchange of 
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monomers between different trimers can only be detected after 
exposure to elevated temperature are evidence of a extremely 
high stability of the tetranectin trimerising structural 
element. This feature must be reflected in the amino acid 
5 sequence of the structural element. In particular, the pre- 
sence and position of the glutamine containing repeat in the 
sequential array of heptad repeats is, together with the 
presence and relative position of the other conserved resi- 
dues in the consensus sequence (Fig. 2) , considered important 

10 for the formation of these stable trimeric molecules. For 

most practical uses the cysteine residue 50 should be mutage- 
nized to serine, threonine, methionine or to any other amino 
acid residue in order to avoid formation of an unwanted 
inter-chain disulphide bridge, which eventually would lead to 

15 uncontrolled multimerisation, aggregation and precipitation 
of a polypeptide product harbouring this sequence. 

In particular in conjunction with the trimer-stabilising exon 
1 encoded polypeptide (tetranectin residues 1 to 16, see 
Example 2), the tetranectin trimerising structural element is 
2 0 a truly autonomous polypeptide module retaining its struc- 
tural integrity and propensity to generate a highly stable 
homotrimeric complex whether it is attached or not by a 
peptide bond at either or at both termini to other proteins. 
This unique property is demonstrated in the accompanying 

2 5 examples, which provide experimental proof, that polypeptide 

sequences derived from heterologous proteins may readily be 
trimerised when joined as fusion proteins to the tetranectin 
trimerising structural element. This remains valid irrespec- 
tive of whether the heterologous polypeptide sequences are 
30 placed amino- terminally or carboxy- terminally to the trimeri- 
sing element allowing for the formation of one molecular 
assembly containing up to six copies of one particular 
polypeptide sequence or functional entities, or the formation 
of one molecular assembly containing up to six different 

3 5 polypeptide sequences, each contributing their individual 

functional property. 
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Since three TTSEs of naturally occurring human tetranectin 
forms up a triple alpha helical coiled coil, it is preferred 
that the stable complex formed by the TTSEs of the invention 
also forms a triple alpha helical coiled coil. 

5 The "tetranectin family" are polypeptides which share the 
consensus sequence shown in Fig. 2 or a sequence which are 
homologous at sequence level with this consensus sequence. 
Hence, monomer polypeptide constructs of the invention are 
preferred which comprise a polypeptide sequence which has at 
10 least 6 8% sequence identity with the consensus sequence shown 
in Fig. 2, but higher sequence identities are preferred, such 
as at least 75%, at least 81%, at least 87%, and at least 
92%. 



By the term "heterologous moiety" is herein meant any chemi- 

15 cal entity which can be linked covalently to a TTSE and to 
which the TTSE is not natively covalently bound. Hence, the 
heterologous moiety can be any covalent partner moiety known 
in the art for providing desired binding, detection, or 
effector properties. The heterologous moiety can be a ligand 

20 binding structure such as a receptor molecule or the ligand 
binding part of a receptor molecule, an antibody, an antigen 
binding antibody fragment, or a molecule having antibody 
characteristics such as e.g. the "diabodies" described in 
EP-A-0 672 142, or other ligand binding molecules such as 

25 avidin or streptavidin, or a lectin; a toxin such as ricin; a 
detectable label such as a fluorescence labelled molecule, a 
radioactively labelled molecule, an enzymatically labelled 
molecule; an in situ activatable substance, such as a mo- 
lecule which can be induced by a magnetic field or by 

30 radiation to be radioactively or chemically active; an enzyme 
such as a peroxidase; a radioactive moiety such as ay-, a-, 
jS"-, or -emitting molecule, e.g. a molecule comprising one 
or more radioactive isotopes selected from -^^C, ^H, -^^P, ^-^P, 
^^S, ^^S, ^^Cl,^^Na, 2%a, ^°K, ^^K, ^^K, and any isotopes 

35 conventionally utilized for the purposes of facilitating 
detection of probes or the purposes of providing localized 



wo 98156906 PCT/DK98/00245 

18 

radiation so as to effect cell death; a cytokine such as an 
interferon or a leukotriene; PNA; a non-proteinaceous polymer 
such as a polymeric alkaloid, 11. a polyalcohol, a 
polysaccharide, a lipid and a polyamine; a photo cross- 
5 linking moiety, i.e. a chemical entity which effects cross- 
linking upon photo -activation; and a group facilitating 
conjugation of the monomer polypeptide construct to a target. 

The heterologous moiety is preferably covalently linked to 
the TTSE by via a peptide bond to the N- or C- terminus of the 

10 TTSE peptide chain, via a peptide bond to a side chain in the 
TTSE or via a bond to a cysteine residue, but any way of 
coupling covalently heterologous material to a polypeptide 
chain will be useful. The skilled person will know of such 
possibilities, e.g. by consulting the teachings of WO 

15 95/31540 in this regard which are hereby incorporated by 
reference. 

However, one interesting aspect of the invention relates to a 
monomer polypeptide construct of the invention comprising two 
heterologous moieties which are linked via peptide bonds to 

20 the N- and C-terminus, respectively. This approach introduces 
a number of possibilities in terms of e.g. linking larger 
entities with oligomers of the invention by having specific 
activities coupled to each end of the monomers (as explained 
in detail below, the oligomers of the invention may also 

25 utilise a version of this principle, where e.g. one N-termi- 
nus and one C-terminus of an oligomer are linked via peptide 
bonds to independent heterologous moieties) . 

In general, a complex between two or three monomers are 
described in the following way: three monomers having one 

30 TTSE each foinns a trimer designated (l+l + l) , whereas a dimer 
formed between a monomer with two TTSEs and a monomer-with 
one TTSE is designated (1+2) . Other (undesired) t rimers can 
of course be formed, e.g. (2 + 2 + 1) , where two TTSEs are not 
"in use", but it is preferred that the oligomers of the 

35 invention use all of their available TTSEs during conplex 
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formation. It should also be noted that the tenn "monomer 
polypeptide construct" is meant to designate a single 
polypeptide chain which may or may not have non-peptide 
groups coupled covalently to the polypeptide chain, whereas 
5 "dimeric polypeptide" or "dimer" , "trimeric polypeptide" or 
"trimer" and "oligomer" (i.e. a dimer or trimer) in the 
present context are meant to designate non-covalent complexes 
of monomer polypeptide' constructs . I.e., the traditional 
. definitions of monomers and multimers do not apply in the 
10 present specification and claims. 

The TTSE as exemplified by exon 2 or exons i and 2 of human 
tetranectin, preferably so modified to allow only hetero- 
trimerisation between dissimilar (1+1+1) or (1+2) (cf. the 
below discussion) subunits may be deployed as a general 

15 affinity mediator, which can be coupled chemically to each 
member of a selection of target molecules. After such conju- 
gation with TTSE the target molecules may be homo- or hetero- 
trimerised as desired for any application. Similar deployment 
of dimerisation, using as one partner a polypeptide harbour- 

20 ing two TTSE sequences in-line, separated by a linker 

sequence of suitable length and character, may bet yet more 
advantageous, as in such case absolute control of 
stoichiometry in complex formation would be possible. Thus, 
an important' embodiment of the invention is a monomer 

2 5 polypeptide construct of the invention comprising 2 TTSEs 

which are covalently linked by a spacer moiety which allows 
both of the 2 TTSEs to take part in complex formation with a 
third TTSE not being part of the monomer polypeptide con- 
struct, but equally important is the embodiment of the inven- 
30 tion where the monomer polypeptide construct comprises one 
single TTSE, so as to allow trimerisation between three 
.monomers and hence providing the optimum degree of versati- 
lity with respect to the number of functional units which can 
be easily incorporated into one single complex. 

3 5 In the embodiments where two TTSEs are present in the same 

monomer it is preferred that the spacer moiety has a length 
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and a conformation which favours complex formation involving 
both of the two TTSEs which are covalently linked by the 
spacer moiety. In this way, problems arising from xmdesired 
formation of trimers of the formats (2+1+1), (2+2+2), and 
5 (2 + 2 + 1) (wherein only one TTSE of each monomer participates 
in complex formation) can be diminished. Design and prepara- 
tion of suitable spacer moieties are known in the art and are 
conveniently effected by preparing fusion polypeptides having 
the format TTSE^- Space r-TTSE^, where the spacer moiety is a 
10 polypeptide fragment (of ten a relatively inert one), so as to 
avoid undesired reactions between the spacer and the sur- 
roundings or the TTSEs. 

One typical scenario, where such modification may be advan- 
tageous is the case of immunological detection where a chemi- 
15 cal conjugate of an antibody with enzymes such as peroxidase 
is used for in situ staining purposes in tissue or on western 
blots . 

A similar, but yet different, application example would be 
the deployment of TTSE to mediate conjugation of e.g. alka- 

2 0 line phosphatase and an oligonucleotide which would allow in 

situ identification of a given mRNA in a tissue sample 
concurrently with identification of any other mRNA molecule 
e.g. by interconnection of a second appropriate oligonucleo- 
tide and a signalling/visualisation molecule using e.g. the 
25 biotin - avidin/streptavidin affinity pair for conjugation. 
The point of having two or more selective affinity systems 
available for conjugating oligonucleotide probes and detector 
molecules is that as many different sequences may be detected 
simultaneously as there are affinity sets available. 

30 In terms of chemistry required to exploit TTSE as a conjuga- 
ting affinity-contributing agent, the peptide corresponding 
to exon 2 will have a sufficient affinity for most purposes, 
but incorporation of all, or some segment of the exon 1 
polypeptide will serve to increase affinity and stability. 

3 5 The properties of tetranectin mutants in which many of hydro- 
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philic (e.g. lys and glu) residues that are largely exterior 
in the coiled coil structure have been replaced with alanine 
appear similar to the native protein, suggesting that is 
indeed possible without interfering very much with stability 
5 of the trimeric structure to replace all glu, asp and lys 

residues by a combination of gin, asn, arg or ala, and there- 
by generate a sequence that, as an N-terminally blocked 
synthetic peptide, would be very easy to convert into a 
chemically stable active-ester component, e.g. an N-hydroxy 

10 succinimide ester of an acetylated peptide, that could react 
with (and thereby couple to) any exposed lysine side chain in 
a target molecule of interest. Such peptide synthesis, acti- 
vation and coupling chemistry will be readily designed and 
applied by a person skilled in the art of peptide chemistry, 

15 as will indeed any other conjugation chemistry, like the 
attachment and use of photo-activatable moieties like e.g. 
phenyl azides. 

In conclusion, it seems that the most important structure in 
native TTSE is the consensus sequence shown in Fig. 2, and 

20 that large variations in the polypeptide chain may be 

allowed. Hence, one advantageous embodiment of the monomer 
polypeptide construct of the invention is one where at least 
one amino acid residue selected from the group consisting of 
amino acid residue nos . 6, 21, 22, 24, 25, 27, 28, 31, 32, 

25 35, 39, 41, 42, is/are substituted by any. non-helix breaking 
amino acid residue, the amino acid residue, numbering refer- 
ring to amino acid residues in SEQ ID NO: 7. All these resi- 
dues have been shown not to be directly involved in the 
intermolecular interactions which stabilises the trimeric 

3 0 complex between three TTSEs of native tetranectin monomers 
and it is therefore expected that these amino acids may be 
safely substituted with any amino acid which will not have an 
adverse effect on helix formation (notably proline, which 
introduces a rigid bend in a polypeptide chain) . 



35 Another advantageous embodiment of the" monomer polypeptide 

construct of the invention is one which is free from any free 
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amino and/or carboxy groups. This would favour synthesis of a 
TTSE by means of solid or liquid phase peptide synthesis, 
since there would be no need of introducing any protective 
groups during peptide synthesis. 

5 Since the consensus sequence of Fig. 2 is believed important 
and since this consensus sequence embraces the above-dis- 
cussed heptad repeat, it is according to the invention pre- 
ferred that the TTSE comprises a repeated heptad having the 
formula a-b-c-d-e-f -g (N to C) , wherein residues a and d 

10 generally are hydrophobic amino acids. However, since "a" and 
"d" in the third of the complete heptads of all known members 
of the tetranectin family are constituted of glutamine, it is 
most preferred that the TTSE comprises the heptad repeated 3 
times and that the last occurrence of the heptad has a 

15 glutamine residue corresponding to residues a and d. 

Since exon 2 of native members of the tetranectin family 
seems to contain the necessary elements to effect stable 
trimerisation, it is preferred that the monomer polypeptide 
construct is free of substantial parts of tetranectin which 

20 is encoded by exon 3 and/or lacks substantial parts of tetra- 
nectin which is encoded by exon l. However, since exon 1 
encoded material seems to stabilise the trimeric native 
tetranectin, it is especially preferred that all or part of 
exon 1 is part of the monomer polypeptide construct, and it 

25 also seems to be rational to include the first three amino 
acids encoded by exon 3 , since these are known to take part 
of the formation of the native triple alpha helical coiled 
coil in human tetranectin. 

One particularly interesting embodiment of the invention is 
30 the possibility of designing oriented molecular assemblies, 
where one or more functional entities are located N-tejcmin- 
ally to the trimerising element and one or more fimctional 
entities are located C-terminally to the element. Such types 
of design may be particularly advantageous where a certain 
3 5 relative ratio is desired among the different functional 
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entities included in a specific molecular unit. Such type of 
design may in addition be used if one or more functional 
entities for either structural or functional reasons appear 
incompatible within the same construct. Such may be the case 
5 if one or more of the functional entities are expressed by 
large or bulky protein domains which for steric reasons might 
prevent formation of the trimeric molecular unit due to 
sterical constraints . 

The possibility of constructing bi-polar three-way fusion 
10 proteins in which one functionality is placed N-terminally to 
the coiled coil structure and a different functionality is 
placed C-terminally is additionally advantageous in applica- 
tions where large spatial separation between the two 
functionalities are desirable for optimal function. Examples 
15 of such application are e.g. the deployment of binding 

domains (e.g. antibody-derived binding modules) for recog- 
nition and binding to binding sites located at or close to 
large structures like cell membranes in cases where it is 
advantageous to allow for binding of the other end of the 

2 0 trimerised molecule to a different, but also bulky target. 

Hence, as discussed above, the oligomers of the invention may 
be used to join e.g. bulky surfaces by the oligomer according 
to the invention comprising at least one heterologous moiety 
which is positioned N-terminally to a TTSE and at least one 
25 heterologous moiety which is positioned C-terminally to a 

TTSE. The two heterologous moieties can be either part of the 
same monomer polypeptide construct or parts of two separate 
monomer polypeptide constructs . 

The extraordinarily high stability of any trimer containing 

3 0 the tetranectin trimerisation module under physiological 

buffer and temperature condition (i.e. absence of denaturant, 
temperature not exceeding 40 °C) in combination with the ease 
by which exchange of monomer subunits between trimers can be 
effected by incubation at moderately elevated temperature or 
3 5 in the presence of denaturants provide for a .unique opportu- 
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nity to deploy the trimerisation module as a vehicle to allow 
the construction of "pick-and-mix" conjugates prepared from 
previously fabricated collections of homotrimeric molecules. 
To illustrate the versatility of this design opportunity by 
5 way of theoretical example, let us assume that (1) a collec- 
tion of twenty different antibody constructs (e.g. in the 
format of single-chain Fv) each of its own characteristic 
binding specificity, has been selected and then turned into 
homo- 1 rimer ic molecules by fusion to a tetranectin trimerisa- 

10 tion module, and let us also assume that a set of twenty 

different effector molecules (e.g. toxin domains) have simi- 
larly been prepared and also conjugated, to the tetranectin 
trimerisation module . A user provided with prefabricated 
collections of twenty different antibody constructs and 

15 twenty different toxin constructs - 4 0 different reagents in 
all - has the opportunity then to prepare 400 different 
toxin-antibody conjugates, simply by mixing a first preferred 
component from one reagent collection with a second preferred 
reagent from the other collection and then subject this 

2 0 binary mixture to conditions, i.e. gentle heating or incuba- 
tion with a suitable level of denaturant, to accomplish 
subunit exchange among all trimeric molecular species in the 
mixture. After the subunit exchange step the desired hetero- 
bifunctional reagent will be present in the mixture as a 

2 5 major component of the mixture and may then be deployed as 

such to accomplish a given purpose or, alternatively apply a 
simple purification step to isolate his favoured hetero- 
functional binary reagent from any remaining mono- functional 
trimer species by a simple standard protein purification 

3 0 step, easily designed using standard techniques known in the 

field of protein purification. 

A further enhancement of the versatility of the "pick-and- 
mix" technology may be achieved by including a specific 
affinity purification tag on each array of trimerisation 
35 module - probe/effector/indicator conjugate, fused directly 
in-line or, alternatively, fused via a cleavable linker (a 
polypeptide segment containing e.g. a factor or an 
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enterokinase recognition/cleavage site) to the affinity tag. 
More specifically, if each of three libraries were tagged 
with affinity handles a, b and c, respectively, that were 
recognised by binding substances A, B and C, respectively, 
5 pure heterotrimers, composed of one member of each library, 
could be obtained in a three-step affinity purification 
procedure designed to allow selective recovery of only such 
t rimers that exhibit affinity for substances A and b and c. 
If, for any reason, subsequent removal of the affinity tags 

10 were desirable, and the const rxicts had been prepared to 

include cleavable linkers, isolation of the pure heterotri- 
mer, liberated from all- affinity tags, could be accomplished 
by three further affinity purification steps, arranged to 
isolate only material that would bind to neither substance A 

15 nor siibstance B nor substance C. 

Obviously, the scope of "pick-and-mix" design of user-prepar- 
able he tero functional complexes apply not only to the forma- 
tion of binary hetero- functionality, but would apply by logic 
extension to the formation of ternary hetero-functionality : 

20 To envisage the wealth of possibilities that are inherent to 
the concept of ternary hetero-functionality in a further 
theoretical example along the lines given above, three sets 
of reagent collections, each comprising 20 different func- 
tional characteristics, i.e. a collection of in toto 60 

25 different homotrimers would allow "pick-and-mix" preparation 
of 8, 0 00 different tri-functional molecules . 

The basic tetranectin trimerisation module will, essentially 
indiscriminately, form homo- and hetero-trimers with any 
molecule that also contains a trimerisation module . For some 

30 applications it may be advantageous to have available 

specially engineered derivatives of the tetranectin trimeri- 
sation module, which have been reengineered to disallow homo- 
trimer formation and hence only allow hetero-trimerisation. 
Thus, an important embodiment of the monomer polypeptide 

35 construct of the invention is constructed/reengineered so as 
to disfavour formation of complexes between identical TTSEs/ 
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this also has the implication that oligomers of the invention 
can advantageously be comprised of monomer polypeptide con- 
structs which are designed so as to disfavour formation of 
trimers including two monomer polypeptide constructs having 
5 identical TTSEs . One way of disfavouring the formation of 
homotremerisation would be by "knobs into holes" mutagenisis. 



The design/reengineering may be accomplished by introduction 
of amino acid substitution at sites in the polypeptide inti- 
. mately involved in the formation and stability of the trimer 

10 and, simultaneously, in a different construct introduce a 
compensatory amino acid substitution, all in all removing 
symmetry between individual monomer components of the triple 
helical structure so that the structural complementarity 
profile only allows the formation of hetero-trimers , but is 

1 5 incompatible with some or each of the homotrimer species . 

A yet different way to deploy the tetranectin trimerisation 
module as a vehicle to accomplish rational formation of 
bifunctionalisation would require the interconnection of the 
C-terminus of one monomer to the N-terminus of a second 
20 monomer in the triple-helical structure. The basic require- 
ment for such an intervening polypeptide is, that allowed 
spatial distances between its N- and C-termini must be com- 
patible with the spacing inherent to the structural require- 
ments given by the architecture of the tetranectin t rimer isa- 
25 tion module. The construction of an intervening connecting 
polypeptide allowed according to such criteria would be 
readily accomplished by an average person skilled in the art 
of protein engineering, as an ample collection of examples of 
the deployment of, usually flexible, spacer sequences are 
30 known both in nature and in designed proteins. Due to the 
expected entropic contribution to interaction energy in a 
molecule in which two of the three tetranectin trimerisation 
module components are covalently tied together, such a mo- 
lecule would show great preference for selecting any molecule 
35 containing only a single copy of the tetranectin trimerisa- 
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tion module component, as this selection would be energeti- 
cally favoured. Hence, conjugation of one functional protein 
component to a suitably selected covalently dimerised tetra- 
nectin trimerisation module component and conjugation of a 
5 different functional protein component to a single-copy 

element of the trimerisation sequence would provide for the 
preferential formation of a 1:1 bifunctional cortplex and 
suppression of formation of any other complex. 

The monomers of the invention may be prepared by methods 
10 generally known in the art, using exclusively or in combina- 
tion the techniques of recombinant protein production, 
peptide synthesis (liquid phase or solid phase) , and tradi- 
tional chemical coupling of heterologous moieties to a 
peptide chain or to specific residues therein. Hence the 

1 5 invention also relates to a method of preparing the monomer 

polypeptide construct of the invention, the method comprising 

isolating the monomer polypeptide construct from a cul- 
ture comprising a host cell which carries and expresses a 
nucleic acid fragment which encodes the monomer 
20 polypeptide construct, 

synthesizing, by means of chemical peptide synthesis, the 
tnonomer polypeptide construct and subsequently isolating 
the monomer polypeptide construct from the reaction 
mixture , 

2 5 - preparing a TTSE in a culture comprising a host cell 

which carries and expresses a nucleic acid fragment which 
encodes the TTSE, subsequently linking covalently at 
least one heterologous moiety to the TTSE, and thereafter 
isolating the resulting monomer polypeptide construct, or 



30 



synthesizing, by means of chemical peptide synthesis, a 
TTSE, subsequently linking covalently at least one 
heterologous moiety to the TTSE, and thereafter the 
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isolating the resulting monomer polypeptide construct 
from the reaction mixture, 

and optionally subjecting the monomer polypeptide construct 
to further processing. 

5 The nucleic acid fragment which is mentioned above is also a 
part of the invention and is defined as a nucleic acid frag- 
ment in isolated form which encodes a TTSE as defined herein 
or which encodes the polypeptide part of a monomer 
polypeptide construct according to the invention, with the 
10 proviso that the nucleic acid fragment is different from one 
that encodes native members of the tetranectin family, and 
that the nucleic acid fragment is different from one that 
encodes any of the fusion proteins CIIH6FXTN123 , H6FXTN123, 
H6FXTN12, H6FXTN23, the sequences of which are shown in SEQ 

15 ID NOS: 24-27 . 

The above mentioned host cell (which is also a part of the 
invention) can be prepared by traditional genetic engineering 
techniques which comprises inserting a nucleic acid fragment 
{normally a DNA fragment) encoding the polypeptide part of a 

20 monomer polypeptide construct of the invention into a suit- 
able expression vector, transforming a suitable host cell 
with the vector, and culturing the host cell under conditions 
allowing expression of the polypeptide part of the monomer 
polypeptide construct. The nucleic acid fragment encoding the 

25 polypeptide may be placed under the control of a suitable 
promoter which may be inducible or a constitutive promoter. 
Depending on the expression system, the polypeptide may be 
recovered from the extracellular phase, the periplasm or from 
the cytoplasm of the host cell. 

30 Suitable vector systems and host cells are well-known-in the 
art as evidenced by the vast amount of literature and 
materials available to the skilled person. Since the present 
invention also relates to the use of the nucleic acid frag- 
ments of the invention in the construction of vectors and in 
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host cells, the following provides a general discussion 
relating to such use and the particular considerations in 
practising this aspect of the invention. 

In general, of course, prokaryotes are preferred for the 
5 initial cloning of nucleic sequences of the invention and 
constructing the vectors useful in the invention. For 
example, in addition to the particular strains mentioned in 
the more specific disclosure below, one may mention by way of 
example, strains such as E. coli K12 strain 294 (ATCCNo. 
10 31446), E. coli B, and E. coli X 1776 (ATCCNo. 31537). These 
examples are, of course, intended to be illustrative rather 
than limiting. 

Prokaryotes are also preferred for expression, since effi- 
cient purification and protein refolding strategies are 
15 available. The aforementioned strains, as well as E. coli 
W3110 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli 
such as" Bacillus subtilis, or other enterobacteriaceae such 
as Salmonella typhimurium or Serratia marcesans, and various 
Pseudomonas species may be used. 

20 In general, plasmid vectors containing replicon and control 
sequences which are derived from species compatible with the 
host cell are used in connection with these hosts . The vector 
ordinarily carries a replication site, as well as marking 
sequences which are capable of providing phenotypic selection 

25 in transformed cells. For example, E. coli is typically 

transformed using pBR322, a plasmid derived from an E. coli 
species (see, e.g., Bolivar et al . , 1977). The pBR322 plasmid 
contains genes for ampicillin and tetracycline resistance and 
thus provides easy means for identifying transformed cells. 

30 The pBR plasmid, or other microbial plasmid or phage must 

also contain, or be modified to contain, promoters which can 
be used by the microorganism for expression. 

Those promoters most commonly used in recombinant DNA con- 
struction include the B-lactamase (penicillinase) and lactose 
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promoter systems (Chang et al . , 1978; Itakura et al . , 1977; 
Goeddel et al . , 1979) and a tryptophan (trp) promoter system 
(Goeddelet al., 1979; EPO Appl. Publ . No. 0036776). While 
these are the most commonly used, other microbial promoters 
5 have been discovered and utilized, and details concerning 
• their nucleotide sequences have been published, enabling a 
skilled worker to ligate them functionally with plasmid 
vectors (Siebwenlist et al . , 1980). Certain genes from proka- 
ryotes may be expressed efficiently in E. coli from their own 
10 promoter sequences, precluding the need for addition of 
another promoter by artificial means. 

In addition to prokaryotes, eukaryotic microbes, such as 
yeast cultures may also be used. Saccharomyces cerevisiase, 
or common baker's yeast is the most commonly used among 
15 eukaryotic microorganisms, although a number of other strains 
are commonly available . For expression in Saccharomyces, the 
plasmid YRp 7, for example, is commonly used (Stinchcomb et 
al., 1979; Kingsman et al . , 1979; Tschemper et al . , 1980). 
This plasmid already contains the trpl gene which provides a 
20 selection marker for a mutant strain of yeast lacking the 
ability to grow in tryptophan for example ATCC No. 44076 or 
PEP4-1 (Jones, 1977) . The presence of the trpl lesion as a 
characteristic of the yeast host cell genome then provides an 
effective environment for detecting transformation by growth 
25 in the absence of tryptophan. 



Suitable promoting sequences in yeast vectors include the 
promoters for 3-phosphoglycerate kinase (Hitzman et al., 
1980) or other glycolytic enzymes (Hess et al., 1968; Holland 
et al., 1978), such as enolase, glyceraldehyde-3 -phosphate 

30 dehydrogenase, hexokinase, pyruvate decarboxylase, phospho- 
fructokinase, glucose- 6 -phosphate isomerase, 3-phosphoglyce- 
rate mutase, pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase, and glucokinase. In constructing 
suitable expression plasmids , the termination sequences 

35 associated with these genes are also ligated into the expres- 
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sion vector 3' of the sequence desired to be expressed to 
provide polyadenylation of the mRNA and termination. 

Other promoters, which have the additional advantage, of 
transcription controlled by growth conditions are the promo- 
5 ter region for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen 
metabolism, and the aforementioned glyceraldehyde-3 -phosphate 
dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Any plasmid vector containing a yeast- 
10 compatible promoter, origin of replication and termination 
sequences is suitable. 

In addition to microorganisms, cultures of cells derived from 
multicellular organisms may also be used as hosts. In prin- 
ciple, any such cell culture is workable, whether from verte- 

15 brate or invertebrate culture. However, interest has been 
greatest in vertebrate cells, and propagation of vertebrate 
in culture (tissue culture) has become a routine procedure in 
recent years (Tissue Culture, 1973) . Examples of such useful 
host cell lines are VERO and HeLa cells, Chinese hamster 

20 ovary (CHO) cell lines, and W13 8, BHK, COS-7 293 and MDCK 
cell lines 

Expression vectors for such cells ordinarily include (if 
necessary) an origin of replication, a promoter located in 
front of the gene to be expressed, along with any necessary • 
2 5 ribosome binding sites, RNA splice sites, polyadenylation 
site, and transcriptional terminator sequences. 

For use in mammalian cells, the control functions on the 
expression vectors are often provided by viral material , For 
example, commonly used promoters are derived from polyoma, 
30 Adenovirus 2, and most frequently Simian Virus 4 0 (SV40) . The 
early and late promoters of SV40 virus are particularly 
useful because both are obtained easily from the virus as a 
fragment which also contains the SV40 viral origin of repli- 
cation (Fiers et al . , 1978). Smaller or larger SV40 fragments 
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may also be used, provided there is included the approximate- 
ly 2 50 bp sequence extending from the Hindi I I site toward the 
Bgll site located in the viral origin of replication. Fur- 
ther, it is also possible, and often desirable, to utilize 
5 promoter or control sequences normally associated with the 
desired gene sequence, provided such control sequences are 
compatible with the host cell systems. 

An origin of replication may be provided either by construc- 
tion of the vector to include an exogenous origin, such as 
10 may be derived from SV40 or other viral (e.g., Polyoma, 
Adeno, VSV, BPV) or may be provided by the host cell 
chromosomal replication mechanism. If the vector is inte- 
grated into the host cell chromosome, the latter is often 
sufficient. 

15 Upon production of the polypeptide monomer constructs- it may 
be necessary to process the polypeptides further, e.g. by 
introducing non-proteinaceous fiinctions in the polypeptide, 
by siabjecting the material to suitable refolding conditions 
(e.g. by using the generally applicable strategies suggested 

20 in WO 94/18227) , or by cleaving off undesired peptide moie- 
ties of the monomer (e.g. expression enhancing peptide frag- 
ments which are undesired in the end product) . 

In the light of the above discussion, the methods for recom- 
binant ly producing the monomer polypeptide construct of the 
25 invention are also a part of the invention, as are the 
vectors carrying and/or being capable of replicating the 
nucleic acids according to the invention in a host cell or a 
cell-line. According to the invention the expression vector 
can be e.g. a plasmid, a cosmid, a minichromosome, or a 
30 phage. Especially interesting are vectors which are inte- 
grated in the host cell /cell line genome after introduction 
• i in the host . 

Another part of the invention are transformed cells (useful 
in the above-described methods) carrying and capable of 
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replicating the nucleic acid fragments of the invention; the 
host cell can be a microorganism such as a bacterium, a 
yeast, or a protozoan, or a cell derived from a multicellular 
organism such as a fungus, an insect cell, a plant cell, or a 
5 mammalian cell . Especially interesting are cells from the 

bacterial species Escherichia, Bacillus and Salmonella, and a 
preferred bacterium is E. coli. 

Yet another part of the invention relates to a stable cell 
line producing, the polypeptide part of a monomer polypeptide 
10 construct according to the invention, and preferably the cell 
line carries and expresses a nucleic acid of the invention. 

On the basis of the above discussions it will be clear to the 
skilled person that also the oligomers resulting from the 
complex formation between the monomer constructs of the 

15 invention are important parts of the invention. Hence the 

invention also pertains to an oligomer which is comprised of 
two monomer polypeptide constructs according to the invention 
which comprises at least three TTSEs> or which is comprised 
of three monomer polypeptide constructs according to the 

20 invention which each only contain one single TTSE. 

As is explained herein and shown in the examples, the oligo- 
mers of the invention are stable at temperatures up to 70 
and it is therefore especially preferred that the oligomers 
of the invention are stable at temperatures above physiologi- 
25 cal ones, e.g. that the oligomers are stable in the tempera- 
ture range 50-70^C. 

Also a part of the invention is a method for preparing a 
dimeric oligomer of the invention which comprises 

admixing a monomer polypeptide construct which includes 
30 two TTSEs (construct 1) with a monomer polypeptide con- 

struct which includes only one TTSE (construct 2) , 
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effecting the two TTSE^s of construct 1 to complex with 
the TTSE of construct 2 (this can be done by thermal 
treatment, i.e. heating to a temperature which ensures 
denaturation followed by subsequent cooling allowing 
5 renaturation, or this can be done by dena- 

turing/renaturing effected by changes in the chemical 
environment) , and 

isolating the resulting dimer and optionally subjecting 
the dimer to further processing (cf . the above discussion 
10 of further processing, but it should also be mentioned 

that the further processing could include non-covalent 
coupling of interesting and relevant moieties to the 
dimeric oligomer) . 

Consequently, the method for producing a trimeric oligomer is 
15 also a part of the invention and comprises the steps of 

admixing three monomer polypeptide constructs of the 
invention with each other, 

effecting complex formation between one TTSE of each 
monomer polypeptide construct, and 

20 - isolating the resulting trimer and optionally subjecting 
the trimeric oligomer to further processing. 

The considerations applying to complex formation and further 
processing mentioned above apply to this method also. 

In view of the detailed discussion above of the I*pick-and- 
25 mix'* aspect of the invention, the invention also pertains to 
a kit comprising 

a first package comprising at least one container means, 
each at least one container means containing a monomer 
polypeptide construct of the invention. 
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a second package comprising at least one container means, 
each at least one container means in the second package 
containing a monomer polypeptide construct of the inven- 
tion, the second package being different from the first 
5 . package with respect to choice and/or number of monomer 
polypeptide constructs included therein, and optionally 

a third package comprising at least one container means, 
each at least one container means in the third package 
containing a monomer polypeptide construct of the inven- 
10 'tion, the second package being different from the first 

and second packages with respect to choice and/or number 
of monomer polypeptide constructs included therein. 

It is preferred that the at least one container means in each 
package contains mutually distinct monomer polypeptide con- 
15 structs, and it is especially preferred that all container 
means comprised in the kit comprises mutually distinct 
polypeptide constructs. 

A very important aspect of the invention is the possibility 
of generating a system designed especially for the individual 
20 circumstances. The basic idea is that the artificial 

selection of heterologous moieties and optionally active 
components, and functional entities result in a unique system 
as will be further disclosed in the following. 

25 Using the TTSE as a vehicle for assembling monovalent scFv or 
Fab antibody fragments into oligomeric and multivalent 
entities offer design advantages also in terms of generating 
chimaeric artificial antibodies having desirable 
pharmacokinetic and pharmacodynamic properties. Small 

3 0 derivatives like monomeric scFv fragments or bivalent 

"minibodies" are rapidly cleared from the circulatory system, 
whereas complete Igs stay for very much longer. Conversely, 
small derivatives like scFv and minbodies exhibit better 
extravasation properties. It is therefore expected that 

35 antibodies of a desired specif icity may be optimized for ' . 
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particular diagnostic or therapeutic needs by engineering the 
pharmacological properties, using the TTSE as a vehicle for 
controlled oligomerization of e.g. scFv fragments. 

One example of such engineering would be the requirements for 
5 delivering a high dose of an imaging or toxin- conjugated 
antibody to a tumour, while ensuring as low a systemic 
exposure or imaging background as possible. In such case a 
TTSE conjugated scFv fragment could be designed to exhibit 
strong multivalent binding to the tumour and rapid clearance 
10 of excess conjugate from circulation. 

Accordingly, in af further aspect the present invention also 
relates to the use of a monomer polypeptide construct or to a 
an oligomer according to the present invention as a vehicle 
for assembling antibody fragments into oligomeric or 
15 multivalent entities for generating chimeric artificial 
antibodies having preselected pharmacokinetic and/or 
pharmadynami c pr ope r t i e s . 

The use of specific delivery systems also play an important 
role in connection with the present invention in that such 
20 systems may by utilized with respect to different use of the 
present invention both with respect to the a more general 
therapeutic application and with respect to gene therapy. 
Exampels of suitable drug delivery and targeting systems are 
disclosed in Nature 392 supp.OO april 1998). 

25 Accordingly, efficiency of deliverance may be further 

increased if the delivery system e.g a liposome is supplied 
with a molecular unit, an "inf actor or transfector" ligand, 
recognized by a internalizing receptor unit specific for the 
target cells. For example, cells displaying endocytotic 

3 0 receptors like members of the LDL family of receptors may be 
even more efficiently infected or transfected either by 
including a TTSE unit in the antibody containing heteorimer 
or in an independent TTSE unit conjugated to one or more of 
the domains of the Receptor Associated Protein, RAP, 
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(Ellgaard, L., Holtet; T.L., Nielsen, P.R., Etzerodt / M . , 
Gliemann, J. & Th0gersen, H.C. Eur J Biochem. 1997, vol 244, 
544-551) which is recognized as a ligand to all receptors in 
this abiandant family of endocytos is -mediating receptors . 

5 Accordingly, in a further aspect, the invention is directed 
to the use of a monomer polypeptide construct or to an 
oligomer according to the invention for targeted gene therapy 
involving selective delivery of the material for transfection 
or infection of the specific population of cells. 

10 The ultimate perspective of such TTSE-mediated gene therapy 
would be the deployment of a viral vector that would find no 
other targets in the patient but the cells. displaying the 
artificial receptor complex. 

In a still further aspect, the invention is directed to the 
15 use of a monomer polypeptide construct or to a an oligomer 

construct according to the invention wherein the at least one 
heterologous moity comprises a moiety selected from a ligand 
binding structure such as a receptor molecule or the ligand 
binding part of a receptor molecule, and wherein the gene 
20 therapy involves the delivery of nucleic acids to the desired 
population of cells by use of a viral vector directed to 
cells displaying the artificial receptor complex 
corresponding to the heterologous moity. 

In another aspect, the invention is directed to the use of a 
25 monomer polypeptide construct or to a an oligomer construct 
according to the invention wherein at least one TTSE is 
modifyed with a membrane integrating or associating entity 
having affinity to the specific population of cells in the 
body relevant for the gene therapy. 

3 0 Furthermore, a recent review of the imaging an therapeutic 
potential of a range of known antibody derivatives has been 
published by Paul Carter and Margaret Merchant of Genentech 
Inc. (Current Opinion in Biotechnology, 1997, vol 8, 449- 
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4 54) . In direct continuation of their conclusions it will be 
apparent that oligomer sat ion of antibody derivatives like 
scFv derivatives may extend current technology in the 
designer-antibody field in many important aspects, some of 
5 which will be elaborated below (with reference to the Carter 
& Merchant review) . 

One of the well-known problems inherent to mouse monoclonal 
antibodies that have been 'humanized* by grafting of the 
murine antigen combining site onto a human Ig framework is 

10 that antigenicity of the chimaeric product in human patients 
is often difficult to suppress entirely, resulting in - 
sometimes life-threatening - immune reactions to the 
diagnostic or therapeutic humanized antibody product. The 
risk of such side-effects are expected to be much reduced if 

15 the designer antibody is assembled from purely human proteins 
or protein fragments. Since the TTSE trimerising unit 
described here is identical to a portion of human tetranectin 
that is already present in human plasma and tissue, there is 
good reason to expect that the TTSE will not elicit an 

20 antigenic response in a human subject if it is introduced as 
a component of a chimaeric product that is not otherwise 
antigenic in humans. 

Accordingly, in one aspect, the present invention relates to 
the use of a monomer polypeptide construct or to a an 
25 oligomer according to the present invention as a component of 
a chimaeric product having low antigenicity in humans 
relative to formulations comprising on or more components of 
non-human origin. 

Carter & Merchant further review present technology for 
30 radiolabelling of antibody derivatives. Again, 

oligomerisation using TTSEs offer more elegant solutions to 
problems associated with labelling, as the TTSE offers the 
possibility to construct one or two of the TTSE monomer units 
in a heterotrimeric complex to harbour the site carrying the 
35 label. Thus, in this format labelling may also be confined to 
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the non-antibody part of the complex, leaving the antigen- 
binding module entirely unmodified, and the complex may 
furthermore be formulated "in the field" as and when needed. 

In many receptor-mediated signal transduction pathways 
5 signals are triggered by the clustering of receptor molecules 
on the cell membrane. The TTSEs therefore have important 
applications in the study and exploitation of receptor 
signalling, as ligands may be presented as oligomers by 
conjugation to a TTSE unit. 

10 This also has important application in phage display 

technologies for discovering new ligands and new receptors as 
the engineering of a TTSE unit fused inline to a candidate 
ligand molecule will allow the display of a hetero-trimeric 
phage coat protein, in which only one of the monomer units is 

15 conjugated to the phage coat protein. This may be 

accomplished by appropriate insertion of amber codons at the 
site of fusion of phage coat protein to the TTSE -ligand 
segment of the three-way fusion protein encoded by the 
recombinant phage. In appropriate E. coli cells the presence 

20 of this amber codon will result in translation termination in 
the majority of read-throughs, and hence most of the fusion 
protein product secreted to the periplasmic compartment in 
the phage-infected bacterium will be soluble TTSE- ligand 
fusion protein, whereas a minority of the fusion protein will 

25 also contain a phage protein module. The majority of trimers 
that will be generated will therefore contain, at most, one 
monomeric unit that will ensure integration (display) in the 
mature recombinant phage particle. 

A further advantage of the display tecnology discribed above 
30 relates to the fact that it is specially useful for selection 
on the basis of a relatively low affinity because of tbe 
entropic benefit contribution obtained by the proximity of 
the tree binding moities in confined spatial arrangement. 
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Accordingly, the present invention in an important aspect, 
also relates to protein library technology wherein the TTSE's 
discribed above are utilized. 

The trimerisation of candidated recombinant ligands is 
5 especially important as, for many receptors, the 

intracellular signal is induced by receptor clustering, which 
is only brought about if the external ligand exhibits 
multivalent binding to the receptor, so as to bridge two or 
more receptor molecules. 

10 In one preferred embodiment the monomer polypeptide construe 
or the oligomer construct according to the invention is for 
targeted gene therapy involving selective delivery of the 
material for transfection or infection of the specific 
population of cells. Tthe at least one heterologous moity may 

15 comprise a moiety selected from a ligand binding structure 
such as a receptor molecule or the ligand binding part of a 
receptor molecule, and wherein the gene therapy involves the 
delivery of nucleic acids to the desired population of cells 
by use of a viral vector directed to cells displaying the 

20 artificial receptor complex corresponding to the heterologous 
moity. 

As mentioned above, it is an important aspect of the 
invention that the monomer polypeptide construct and/or the 
oligomer may be used as a component of a chimaeric product 

2 5 having low antigenicity in humans. As the construe is of 

human origin it is believed that the antigenicity in humans 
is low relative to formulations comprising on or more 
components of non-human origin. 

One primary use of a monomer polypeptide construct or a an 

3 0 oligomer according to the invention is for delivering an 

imaging or toxin-conjugated antibody to a target such as a 
tumor, or use as a vehicle delivering an substance to a 
target cell or tissue, as a vehicle for assembling antibody 
fragments into oligomeric or multivalent entities for 
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generating chimeric artificial antibodies having preselected 
pharmacokinetic and/or pharmadynamic properties. 

The substance in question being one or more selected from the 
group of heterologous moities as well a pharmaceutical. Also 
5 a labelled construct wherein the label is coupled to one or 
to of the TTSE monomer units is within the scope of the 
invention. 

As explained in detail previously, an important and 
surprising use of the monomer polypeptide construct or the 
10 oligomer according to. the pesenent invention is for protein 
library technology, such as phage display technology. The 
present invention also relates to any polynucleotid molecule 
such as a RNA, DNA or PNA as well as any vector encoding one 
or more TTSE. 

15 A further use according to invention includes preparation and 
use of a pharmaceutical composition comprising the TTSE 
construct and optionally a pharmaceutical ly acceptable 
excipient. The composition may be administered by a route 
selected from the group consisting of the intraveneous route, 

20 the intraarterial route, the . transmembraneus route of the 

buccal, anal, vaginal or conjunctival tissue, the intranasal 
route, the pulmonary route, the transdermal route, the 
intramuscular routed, subcutaneous route, intratechal route, 
inoculation into tissue such as a tumour, or by an implant. 

2 5 The the monomer polypeptide construct or the oligomer is in a 
preferred embodiment comprised in a liposome. 

It is obvious from the disclosure of the present invention 
that the treating or preventing of a disease may by a further 
aspect comprising administering to the subject in need 
30 thereof an effective amount of a pharmaceutical composition 
referred to above. 
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In one aspect of the various possibilities according to the 
present invention concerning how the human gene therapy is 
targeted, includes the case wherein at least one TTSE is 
modified with a membrane integrating or associating entity 
5 having affinity to the specific population of cells in the 
body relevant for the gene therapy. 

As used in the conventional pharmaceutic field the present 
invention includes a method wherein the monomer polypeptide 
construct or the oligomer is administered by a route selected 

10 from the group consisting of the intraveneous route, the 

intraarterial route, the transmembraneus route of the buccal, 
anal og vaginal tissue, intranasal route, the pulmonary 
route, the transdermal route, intramuscular, subcutaneous, 
intratechal, the buccal, inoculation into tissue such as a 

15 tumour, or by an implant. 

Finally, the present invention is also relating to the field 
of dianosing as the skilled person would easyly recognice, 
that the TTSE disclosen in the present inventiion may also 
refer to a method for diagnosis comprising a construct 
2 0 comprising the monomer polypeptide construct or the oligomer, 
together with a diagnosing component coupled thereon. 

EXAMPLE 1 

Design and construction of the pTHStrip E. coli expression 
vectors for the production of trimerised chimeric fusion 
25 proteins. 

The plasmid clone pT7H6FXTN123 (Example 2) was used as tem- 
plate for amplification in two Polymerase Chain Reactions 
(PGR) (Saiki et al., 1988) with the primer pairs trip-N (SEQ 
ID NO: 1) and trip-Ca (SEQ ID NO: 2) and trip-N (SEQ ID NO: 
30 1) and trip-Cb (SEQ ID NO: 3), respectively. The amplified 
DNA fragments, trioa , comprising nucleotide sequences enco- 
ding an IQGR cleavage site for the restriction protease EX, 
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• (SEQ ID NO: 4) followed by two sites -for the restriction 
nucleases Bglil and Kpnl,.the nucleotide sequence encoding 
the tetranectin polypeptide sequence for Glu l to Lys 52 (SEQ 
ID NO: 5) followed by recognition sites for the three re- 
5 strict ion endonucleases BamHI, Hindi II, and EcoRI, respec- 
tively, and triDb > comprising nucleotide sequences encoding 
an IQGR cleavage site for the restriction protease EX, (SEQ 
ID NO: 4) followed by two sites for the restriction nucleases 
Bglll and Kpnl, the nucleotide sequence encoding the tetra- 
10 nectin polypeptide sequence for Glu 1 to Val 49 (SEQ ID NO: 
6) followed by recognition sites for the three restriction 
endonucleases BamHI, Hindlll, and EcoRI, respectively, were 
subcloned into the plasmid pT7H6 (Christensen e t al., 1991), 
yielding pTtripa and pTtripb, respectively (Figs. 3 and 4) , 

15 EXAMPLE 2 

Tetranectin, localisation of the trimerising structural 
element and stability of the triple alpha helical coiled 
coil. 

The cDNA encoding the reading frame corresponding to the 

20 mature tetranectin single chain (SEQ ID NO: 7) was cloned by 
specific amplification in a Polymerase Chain Reaction (PGR) 
(Saikiet al., 1988) of the nucleotide sequences from amino 
acid residue Glul to VallSl using 1^^ strand oligo-dT primed 
cDNA synthesized from total human placental RNA as template. 

25 Primers used in the PGR were SEQ ID NO: 8 and SEQ ID NO: 9. 
RNA extraction and cDNA synthesis were performed using stan- 
dard procedures. The amplified reading frame encoding the 
monomer subunit of tetranectin was at the 5' -end, via the 
PCR-reaction, linked to nucleotide sequences encoding the 

30 amino acid sequence SEQ ID NO: 10 which constitute an lEGR 
cleavage site for the bovine restriction protease EX, (Nagai, 
and Thogersen, 1987) . A glycine residue was, due to the 
specific design of the 5' -PGR primer (SEQ. ID NO. 8), 
inserted between the G-terminal arginine residue of the EK, 

35 cleavage site (SEQ ID NO. 10) and the tetranectin Glul- resi- 
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due. The amplified DNA fragment was subcloned into the E. 
coli expression vector pT^H^ (Christensen et al. , 1991) 
producing the plasmid PT7H6.FX-TNI23 expressing the tetranec- 
tin monomer H6FXTN123 (SEQ ID NO: 25) and into pT^CIIHg, 

5 which is a derivative of t>TrjH^, where the amino-terminal 32 
amino acid residues of the lambda CII protein (SEQ ID NO. 11) 
are inserted 5' of the six histidine residues (SEQ ID NO. 12) 
as outlined in Fig. 5, yielding PT7CIIH6FX-TNI23 expressing 
the tetranectin fusion protein CIIH6FXTN123 (SEQ ID NO: 24) . 

10 The amino acid sequence of the expressed proteins are shown 
in Fig. 6 (in SEQ ID NO: 7 is given the amino acid sequence 
of the mature tetranectin protein) . Furthermore three addi- 
tional derivatives of tetranectin were constructed (Fig. 8) : 
H6FXTN12 comprising the tetranectin amino acid residues Glul 

15 to Val4 9 (SEQ ID NO: 6) , H6FXTN2 3 comprising the tetranectin 
amino acid residues VallV to VallSl (SEQ ID NO: 7) , and 
H6FXTN3 (SEQ ID NO: 30) comprising the tetranectin amino acid 
residues Ala45 to VallSl (SEQ ID NO: 7 ) . These three tetra- 
nectin derivatives were constructed by specific amplification 

20 in a PGR using pT7HgFX-TN123 as template and the. primer-pairs 
SEQ ID NO: 8 and SEQ ID NO: 13, SEQ ID NO: 14 and SEQ ID NO: 
9, and SEQ ID NO: 15 and SEQ ID NO: 9, respectively. The 
amplified DNA fragments were subcloned into the E. coli 
expression vector pT^H^ producing the plasmids pT7HgFX-TN12 , 

25 pT7HgFX-TN23, and pT^HgFX-TNS , respectively (Fig. 7) . 

To prepare recombinant tetranectin and its derivatives, each 
of the plasmids PT7H6FX-TNI23 , pT7CIIHgFX-TN123 , PT7H6FX-TNI2 , 
pT7HgFX-TN23, and pT7HgFX-"TN3 were grown in medium scale (4x 
1 litre; 2xTY medium, 5 mM MgS04 and 100 fig ampicillin) in E. 
30 coli BL21 cells, as described by Studier et al . (1990) . 
Exponentially growing cultures at 3 7°C were at cn„ 0.8 
infected with bacteriophage lambda CE6 at a multiplicity of 
approximately 5. Cultures were grown at 37<>C for another 
three hours and the cells harvested by centrifugation. 

35 Cells were resuspended in 150 ml of 0.5 M NaCl, 10 mM 

Tris-HCl pH 8, and 1 mM EDTA pH 8. Phenol (100 ml adjusted to 
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pH 8) was added and the mixture sonicated to extract the 
total protein. Protein was precipitated from the phenol phase 
by 2.5 volumes of ethanol and centrifugation. 

The protein pellet was dissolved in a buffer containing 6M 
5 guanidinium chloride, 50 mM Tris-HCl pH 8 and 0.1 M dithio- 
erythriol. Following gel f iltratipn on Sephadex G-25 (Pharma- 
cia, Sweden) into 8 M Urea, 1 M NaCl, 50 mM Tris-HCl pH 8 and 
10 mM 2-mercaptoethanol, the crude protein preparation was 
applied to a Ni^* activated NTA-agarose column (Ni^^NTA-aga- 
10 rose, 75 ml pre-washed with 8 M urea, 1 M NaCl, 50 mM 

Tris-HCl pH 8, and 10 mM 2-mercaptoethanol) for purification 
(Hochuli et al., 1988) and refolding of the fusion proteins, 
H6PXTN123, CIIH6FXTN123, H6FXTN12, H6FXTN23, and' H6FXTN3 . 

For this study we chose to prepare our own Ni^'^NTA-agarose 
15 matrix. A carbodiimide coupling of the N- (5 -amino- 1- car boxy- 
pentyl) iminodiacetic acid metal ligand (synthesis route as 
described by Dobeli & Hochuli (EP-A-O 253 303)) to a rigid 
agarose matrix (Sepharose CL-GB, Pharmacia, Sweden) was 
performed: 

20 8 g of N- (5 -amino- 1-carboxypentyl) iminodiacetic acid from the 
synthesis procedure in 50 ml was adjusted to pH 10 by addi- 
tion of 29 g of Na2CO3(10 HjO) and added to a stirred suspen- 
sion of activated Sepharose CL-GB in 1 M Na2C03 . Reaction was 
allowed overnight. The Sepharose CL-GB (initially 100 ml 

25 suspension) was activated after removal of water by acetone 
with 7 g of 1, 1' -carbonyldiimidazol under stirring' for 15 to 
30 min. Upon activation the Sepharose CL-GB was washed with 
acetone followed by water and 1 M Na2C03 . 

The NTA-agarose matrix was loaded into a column and "charged" 
30 with Ni^"^ by slowly passing through 5 column volumes cf a 10% 
NiS04 solution. The amount of Ni^+ on the NTA-agarose matrix, 
prepared by this procedure, has been determined to 14 ptmol 
per ml matrix. After charging the Ni^^NTA-agarose column was 
washed with two column volumes of water, one column volume of 
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1 M Tris-HCL pH 8 and two column volumes of loading buffer 
before stirred mixing of the Ni^^NTA-agarose matrix with the 
crude protein extracts in a breaker for 15 to 3 0 min. All 
buffers prepared for liquid chromatography were degassed 
5 under vacuum prior to addition of reductant and/or use . 

The Ni^^NTA- agarose matrix - crude extract mixture was packed 
in standard glass columns for liquid chromatography (internal 
diameter: 2.6 cm) to a volume of approximately 40 ml. The 
columns were washed with 200 ml of 8 M urea, i M NaCl, 50 mM 

10 Tris-HCl pH 8, and 10 mM 2-mercaptoethanol (Buffer I) and 100 
ml 6 M guanidinium chloride, 50 mM Tris-HCl pH 8 and 10 mM 
2-mercaptoethanol (Buffer II) and the adsorbed tetranectin 
derived fusion proteins H6PXTN123, H6CIIFXTN123 , H6FXTN23, 
and H6FXTN3 refolded using the cyclic refolding procedure as 

15 described (Thogersen et al., WO 94/18227) . 

The fusion protein H6FXTN12 was refolded by removing the 
guanidinium chloride and 2-mercaptoethanol of buffer II in a 
gradient over 5 column volumes into 50 mM Tris-HCl pH 8 and 
0 . 5 M NaCl . After cotnpletion of the refolding procedures the 
20 tetranectin derived fusion proteins were eluted from the 

Ni^+NTA- agarose columns with a buffer containing 0.5 M NaCl, 
50 mM Tris-HCl, 25 mM EDTA pH 8. The tetranectin fusion 
proteins H6FXTN12 3, H6FXTN23, and H6FXTN3 were cleaved with 
EX, at 4«'C overnight in a molar ratio of 1:3 00. After FX, 
25 cleavage the protein samples were concentrated 10 fold by 
ultrafiltration on YMIO membranes (Amicon) . After ten times 
dilution of the protein sample with 2 mM CaCla, the recombi- 
nant tetranectin derivatives TN123, TN23, and TN3 were iso- 
lated by ion-exchange chromatography on Q-Sepharose (Pharma- 
30 cia, Sweden) in a linear gradient over 10 column volumes from 
10 mM Tris-HCl pH 8 , 2 mM CaClj to 10 mM Tris-HCl pH 8 , 2 mM 
CaCl2. and 0.5 M NaCl. After elution from the Ni^+NTA-agarose 
columns the fusion proteins H6CIIPXTN123 and H6PXTN12 were 
likewise concentrated 10 fold by ultrafiltration on YMIO 
35 membranes and gelfiltrated into buffer containing 25 mM Tris- 
HCl pH 8, 25 mM NaCl, and 2 mM CaClj, before purification of 
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correctly folded monomer by ion-exchange chromatography on 
Q-Sepharose as described. 

Recombinant full length tetranectin (TN123) produced by these 
procedures have been analyzed with respect to binding to 
5 plasminogen kringle 4 and immobilised fucoidan, expression of 
antigenic sites, and localization of disulphide bridges. In 
all criteria tested the produced TN123 behaved identically to 
isolated naturally human tetranectin (data not shown) .. Fur- 
thermore TN123 and TN3. have been crystallized (Kastrup et 
10 al.,1996) and the structure has also been determined, all of 
which bear evidence that a single unique and biologically 
active folded product had indeed been produced. 

Analytical grel filtration analysis of rTN proteins. 

Analytical gelf iltration of the recombinant tetranectin 
15 derivatives TN123, TN3 , and TN23 (Fig. 9) were performed on a 
Superose 12 HR 10/30 column (Pharmacia, Sweden) with a total 
volume of 25 ml in 100 mM NaCl and 50 mM Tris-HCl pH 8 and a 
flow rate of 0.2 ml/min. The K^v value is defined by, 
Kav={Ve-Vo)/ (Vc-Vo). 

20 The gelf iltration analysis of TN123 and TN23 show that both 
proteins are exclusively found as trimers in solution (k^^ 
values of 0.2 7 and 0.29, respectively), whereas TN3 appeared 
monomeric ( K^^ : 0 . 4 1) , 

Chemical cross-linking of tetranectin and derivatives 

25 The recombinant tetranectin derivatives TN123, TN3, and TN23, 
together with the fusion proteins CIIH6FXTN123 and H6FXTN12 
or mixtures of these derivatives at l mg/ml concentrations in 
cross-linking buffer (O.lM Sodium borate, pH 9.1) were 
incubated with dimethylsuberimidate (DMSI, Sigma) . 10 ptl 

30 aliquots of protein solution were incubated with 1 fil ali- 
quots of DMSI stock solution (20 mg/ml in cross-linking buf- 
fer) for 30 minutes at 25'=*C before addition of 2 ill quenching 
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buffer (3 M Tris-HCl, pH 9) . Subunit exchange between pre- 
formed homo-oligomers was induced by subjecting protein 
mixtures to heat shock treatment. Five ^1 aliquots of each 
protein solution (1 mg/ml stocks) were mixed at 0°C in stan- 
5 dard polypropylene microcentrifuge tubes, transferred to a 
water bath at 70 °C for the time spans indicated, and then 
further incubated for 15 minutes at 25 before reaction with 
DMSI . 

10 Prior to analysis by SDS-PAGE (12% gels) of the cross-linked 
products the reaction samples were boiled in the presence of 
SDS and mercaptoethanol . 

Cross-linking analysis of TN12 3 and the fusion protein 
CIIH6FXTN123 showed that no detectable subunit exchange 
15 between pre -formed homo-oligomers in a mixture of TNI 2 3 and 
CIIH6FXTN123 was found after 16 hours at room temperature 
(Fig. 10) . Subunit exchange could be induced by incubating 
the protein mixture at 7 0°C for 15 seconds or longer before 
cooling to room temperature and addition of DMSI . SDS-PAGE 
20 analysis showed the presence of four t rimer bands above 95 
kDa (corresponding to two homo-trimers and two hetero-tri- 
mers) and three dimer bands (corresponding to two homo-dimers 
and one hetero-dimer) in the gel between 43 and 55 kDa, in a 
relative abundance in agreement with random association of 
25 monomer subunits into trimers after subunit exchange. It 

should be noted, that molecular weight markers have only been 
included on the SDS-PAGE gels for crude calibration and 
orientation of the gels. 

The trimeric organization of tetranectin was further corro- 
30 borated by cross-linking studies of the proteins H6FXTN12, 

TN23, and TN3 and mixtures between them (Fig. 11) . The tetra- 
nectin derivative TN3, containing only the CRD, could not be 
cross-linked even at high protein concentrations and did not 
interfere with the cross-linking of rTN123 , Likewise, the 
35 derivative TN23, containing exon 2 and the CRD, appeared 

monomeric after cross- linking and was found not to interfere 
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with trimerisation of TN123 during subunit exchange. Dimeric 
TN23 molecules found at low abundance in the sample probably 
reflects contaminating misfolded disulphide bridged dimers. 
The fusion protein H6FXTN12 formed homo.-trimers upon cross - 
5 linking and generated hetero-trimers with TN123 after subunit 
exchange. Because of the difference in size of full length 
tetranectin {TN123) and H6FXTN12 the possible nine protein 
bands resulting from chemical cross-linking are: The four 
trimers [{TN123)3, (TN123) 2 (H6PXTN12) , (TN123) (H6PXTN12)2, 
10 and (H6PXTN12)3] at approx. 95 kDa , 50 kDa, 37 kDa, and 20 
kDa, respectively; the three dimers [(TN123)2i 
(TN123) (H6FXTN12) , and (H6FXTN12)2] at approx. 45 kDa, 30 
kDa, and 15 kDa^ respectively; and the two monomers TN123 at 
23' kDa and H6FXTN12 at. 9 kDa. . 

15 Taken together, the gel filtration and the cross-linking 

analysis of the tetranectin derivatives show that tetranec- 
tin, like the collectin group of C-type lectins, is a 
trimeric molecule and that amino acid residues directly shown 
to be involved in trimerisation of the tetranectin monomer 

20 are located in axon 2 of the protein (Vall7 - yal49) . Fur- 
thermore subunit exchange between the trimeric molecules 
could only be observed after heat denaturation. Amino acid 
residues Glul to Aspl6 of tetranectin are critical to chemi- 
• cal cross -linking with DMSI and more important appear to 

25 stabilize the trimeric molecule because the cross-linking 

analysis of the mixture TN123 and TN23 showed no decrease in 
TN123 formation after heat denaturation and possible subunit 
exchange (Fig. 11) . The stability of the tetranectin trimer 
was corroborated by a cross-linking analysis with DMSI at 

30 different temperatures. Fifteen fil TN123 at 0.3 mg/ml concen- 
tration was pre-incubated 10 min. at either 37^C, 50^C, 60^0, 
or 70^C before addition of 2 ptl DMSI (20 mg/ml) . The reaction 
was allowed to proceed for 15 min. before reaction was 
quenched with 5 (xl of 3M Tris-HCl pH 9.1 and the reaction 

35 mixtures allowed to cool to room temperature. SDS-PAGE analy- 
sis of reduced samples (Fig. 12) showed, that trimers are 
readily detectable even at 60^ C\ although a competing pattern 
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of cross-linking specimens increases at increasing tempera- 
tures. The appearance of other cross-linking specimens is 
probably due to the unfolding of the CRD. The stability of 
the tetranectin trimerising structural element is further 
5 analyzed using a designed chimeric protein in Example 3 . 

EXAMPLE 3 

Design and construction of the recombinant chimeric protein 
TRIPB-UB - the tetranectin trimerising structural element and 
ubiquitin. 

10 A plasmid clone, pLCMHP/UB, generously provided by Dr. O . 
Wiborg harbouring a human ubiquitin cDNA insert (SEQ ID: 16) 
was used as template and SEQ ID NO: 17 together with SEQ ID 
NO: 18 were used for amplification in a Polymerase Chain 
Reaction (PGR) (Saiki et al . , 1988) of the nucleotide 

15 sequence encoding amino acid residue Ilel to Gly76 of human 
ubiquitin (SEQ ID: 19) . The amplified DNA fragment was after 
digestion with the restriction endonucleases BamHI and 
Hindlll ligated into the BamHI and Hindlll sites of pTtripb 
(Example 1) yielding pTtripb>UB (Fig. 13) using standard 

20 procedures. 

To prepare the chimeric fusion protein H6FXtripb-UB (Fig. 14, 
SEQ ID NO: 31) the plasmid pTtripb-UB was grown in medium 
scale (4X1 litre/ 2xTY medium, 5 mM MgS04 and 100 jxg 
ampicillin) in E. coli BL21 cells, as described by Studier et 

25 al. (1990) .Exponentially growing cultures at 37^C were at 
GD,,, 0.8 infected with bacteriophage lambda CE6 at a multi- 
plicity of approximately 5. Cultures were grown at 37°C for 
another three hours and the cells harvested by 
centrifugation. Cells were resuspended in 150 ml of 0.5 M 

30 NaCl, 10 mM Tris-HCl pH 8, and 1 mM EDTA pH 8. Phenol (100 ml 
adjusted to pH 8) was added and the mixture sonicated to 
extract the total protein. Protein was precipitated from the 
phenol phase by 2.5 volumes of ethanol and centrifugation. 
The protein pellet was dissolved in a buffer containing 6 M 
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guanidinium chloride, 50 mM Tris-HCl pH 8 and 0.1 M dithio- 
erythriol. Following gel filtration on Sephadex G-25 (Pharma- 
cia, Sweden) into 8 M Urea, l M NaCl, 50 mM Tris-HCl pH 8 and 
10 mM 2-mercaptoethanol , the crude protein preparation was 
5 applied to a Ni^"^ activated NTA-agarose column for purifi- 
cation (Hpchuli et al.. 1988) and refolding of the fusion 
protein H6FXtripb-UB . 

Synthesis and charging of the Ni^**" activated NTA-agarose 
matrix is described in Example 2 . All buffers for liquid 

10 chromatography were degassed prior to use. The fusion protein 
H6FXtripb-UB was refolded by removing the urea and 2-mercap- 
toethanol from buffer 11 in a gradient over 5 column volumes 
into 50 mM Tris-HCl pH 8 and 0.5 M NaCl . After completion of 
the refolding procedure the H6FXtripb-UB fusion protein was 

15 eluted from the Ni^^NTA- agarose columns with a buffer con- 
taining 0.5 M NaCl, 50 mM Tris-HCl, 25 mM EDTA pH 8 and gel 
filtrated on a Sephadex G50 column (Pharmacia) into 0.1 M 
Sodium borate pH 9 buffer for chemical cross-linking analysis 
with DMSI. 

20 The cross-linking analysis experiment was designed both to 
analyze the oligomeric status of the chimeric fusion protein 
and the thermal stability of the presumed fusion protein 
trimer as described in Example 2: Samples of 15 ^1 H6FXtripb- 
UB fusion protein, at approximately l.omg/ml concentration, 

25 were pre-incubated 10 min. at either 37°C, 50*^C, 60^C, or ' 
70°C before addition of 2 ixl DMSI (20mg/ml) . The reactions 
were allowed to proceed for 15 min. before quenching by 
addition of 5 /zl of 3 M Tris-HCl pH9 . 1 and the reaction 
mixtures were allowed to cool to room temperature. SDS-PAGE 

30 analysis of reduced samples (Fig. 12) showed, (1) that the 

fusion protein H6FXtripb-UB is a trimer in solution (monomer 
at 17 kDa, dimer at 3 5 kDa, and trimer at 43 kDa) and (2) 
that a substantial amount of trimer molecules is present even 
at 70*'C. The appearance of other larger cross-linking pro- 

35 ducts is probably due to cross-linking of trimers via the 
ubiquitin part of the fusion protein. 
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EXAMPLE 4 

Design and construction of trimerised and hexamerized CEA6 
scFv antibodies scFv (CEAG) -TRIPB, TRiPB-scFv (CEA6) and 
scFv {Q,EkG)-TRIPB-scFv (CEA6) . 

5 A plasmid clone, pUC19MCH/CEA6 , generously provided by Dr. 
Kevin Pritchard, Cambridge Antibody Technology Ltd., Mel- 
bourn, UK, harbouring a nucleotide sequence (SEQ ID: 20) 
encoding the CEA6 antibody in single-chain Fv (scFv) format, 
followed in sequence by a "myc tag" (which is a general 

10 purification/detection handle) , was used as template in 
Polymerase Chain Reactions (PCR) (Saikiet al., 1988) in 
which the nucleotide sequence encoding the scFv + myc tag was 
amplified using the primer pairs (SEQ ID: 21 and SEQ ID: 22) 
and (SEQ ID: 21 and SEQ ID: 23) to generate PCR fragments "A" 

15 and "B" . 

PCR fragment "A" was treated with restrictions enzymes BamHI 
and Kpnl and the resulting fragment was inserted into 
Bglll/Kpnl cut pTripb (Example l) to obtain the vector 
pTH6PXscPv(CEA6) -tripb (Fig. 15) encoding the H6FXscPv (CEA6) - 

20 TRIPB fusion protein (Fig. 16). PCR fragment "B" was treated 
with restriction enzymes BamHI and Hindlll and the resulting 
fragment was inserted into BamHI and Hindlll cut pTripb 
(Example 1) to obtain the vector pTH6FXtripb-scFv(CEA6) (Fig. 
17) encoding the H6FXTRlPB-scFv(CEA6) fusion protein (Fig . 18 , 

25 SEQ ID NO: 33) using standard procedures . 

To generate the expression vector pTH6FXscFv(CEA6) -tripb- 
scFv(CEA6) (Fig. 19) encoding the HSFXscFv (CEA6) -TRIPB- 
scFv(CEA6) fusion protein (Fig. 20, SEQ ID NO: 34) the insert 
in the vector pTH6FXtripb-scPv(CEA6) was excised using re- 
30 striction enzymes BamHI and Hindlll and inserted into the 
vector pTH6PXscFv(CEA6) -tripb, which had been treated with 
restriction enzymes BamHI and Hindlll. 
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To prepare the chimeric fusion proteins H6FXscFv(CEA6) -TRIPB 
(SEQ ID NO: 32), H6 FXTRI PB - scFv (CEA6 ) (SEQ ID NO : 33) and 
H6FXscFv(CEA6) -TRIPB-scFv(CEA6) (SEQ ID NO: 34) the plasmids 
pTH6FXscFv(CEA6) -TRIPB, pTH6 FXtripb - scFv (CEA6 ) and 
5 pTH6FXscFv(CEA6) -tripb- scFv(CEA6) were grown in small scale 
(1 litre; 2xTY medium, 5 mM MgS04 and 100 ampicillin) in 
E. coli BL21 cells, as described by Studier et al. (1990) . 
Exponentially growing cultures at 37*=*C were at ODgQQ 0.8 
infected with bacteriophage lambda CE6 at a multiplicity of 

10 approximately 5. Cultures were grown at 3 7°C for another 

three hours and the cells harvested by centrifugation. Cells 
were resuspended in 50 ml of 0.5 M NaCl, 50 mM Tris-HCl pH 8, 
and 1 mM EDTA pH 8. Phenol (50 ml adjusted to pH 8) was added 
to each and the mixtures were sonicated to extract total 

15 protein. After clarification by centrifugation (25 minutes at 
10.000 g) crude protein fractions were precipitated from the 
phenol phases by addition of 2.5 volumes of ethanol and 
centrifugation. Protein pellets were dissolved in a buffer 
(15-25 ml) containing 6 M guanidinium chloride, 50 mM 

20 Tris-HCl pH 8 and 0.1 M dithioerythriol . Following gel fil- 
tration on Sephadex G-2 5 (Pharmacia, Sweden) into 8 M Urea, 1 
M NaCl, 50 mM Tris-HCl pH 8 and 10 mM 2-mercaptoethanol , the 
crude protein preparations were applied to Ni^"^ activated 
NTA-agarose columns (75 ml column volume) for purification 

25 (Hochuli et al . , 1988). Washing buffer (6 M guanidine-HCl, 50 
mM Tris-HCl pH 8 and 10 mM 2-mercaptoethanol) was then flowed 
through the columns until stable baselines were obtained. 
Virtually pure fusion proteins could then be eluted by apply- 
ing a pH gradient to each column (1000 ml gradient in 8 M 

30 urea and 10 mM 2-mercaptoethanol obtained by linear (per 

volume) mixing of solutions containing 50 mM sodium di- hydro - 
genphosphate (pH 5 buffer) and 50 mM di-sodium hydrogenphos - 
phate (pH 8 buffer) . 

In preparation for in vitro refolding by the method of Tho- 
35 gersen et al. (WO 94/18227) 20 mg of each purified fusion 
protein were mixed in suspensions in refolding "buffer B" 
(described below) with aliquots of suspensions of Ni^"" acti- 
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vated NTA-agarose matrix sufficient to generate coluinns of 
about 75 ml packed bed volume. Each fusion protein was then 
subjected to the iterative refolding procedure as described 
for plasminogen kringle 4 in the Th0gersen et aL patent 
5 application (WO 94/18227) , except that refolding of the scFv 
containing fusion proteins was carried out at lO^'C using a 
buffer containing 0.5 M NaCl, 50 mM Tris-HCl pH 8, 2 mM 
glutathione and 0.2 mM oxidized glutathione as "buffer A" and 
a buffer containing 8 M urea, T M NaCl, 50 mM Tris-HCl pH 8 
10 and 2 mM glutathione as "buffer B" . 

After completion of the refolding procedure each column was 
washed with 300 ml buffer containing 0 . 5 M NaCl and 50 mM 
Tris-HCl pH 8 to wash away glutathione. The refolded fraction 
of each protein was then eluted from the NTA-agarose matrix 
15 by addition of 20 mM EDTA to the elution buffer. After addi- 
tion of solid urea to achieve a final concentration of about 
8 M to each protein sample and dilution or dialysis to reduce 
NaCl concentrations to below 5 mM, final purification of each 
correctly folded fusion protein product was then accomplished 
20 by ion exchange chromatography (S-Sepharose, Pharmacia, 1,6 
(i.d.) by 90 centimeter column in a buffer containing 8 M 
urea, 5 mM Tris-HCl (froml M stock solution at pH 8) and 25 
mM sodium acetate (from i M stock solution at pH 5) , eluted 
at 2 ml/min) . After dialysis against aqueous buffers (e.g. 
25 phosphate buffered saline) each pure and correctly refolded 
fusion protein was recovered in yields of 2-6 mg per litre of 
culture grown. Each protein may be shown by analytical gel 
filtration, chemical cross-linking analysis, by in vitro 
affinity measurements and by in vivo efficacy to form a 
30 stable homotrimeric molecular complex: The oligomeric status 
of the H6FXtripb-scFv- (CEA6) fusion protein was analyzed by 
chemical cross-linking analysis with DMSI: In parallel ex- 
periments, samples of H6PXtripb-scFv- (CEA6) at 0.34n)g/ml and 
TN12 3 at 0.28 mg/ml in 0.1 M Sodium borate were incubated at 
35 room temperature with increasing amounts (0 - 40 /xg in 12 ^1 
in total) of DMSI for 3 0 min. Reactions were quenched by 
addition of 5 ^1 3M Tris-HCl pH 9 and the samples analyzed by 
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SDS-PAGE under reducing conditions (Fig. 21) . Like tetranec- 
tin, the H6FXtripb-scPV- (CEA6) .fusion protein, of approxi- 
mately 38 kDa, is hereby shown to be a trimer in solution. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION 

(i) APPLICANT 

(A) NAME: Thragersen, Hans Christian 

(B) STREET: Ristrupvej 41 

(C) CITY: Mundelstrup 

(D) STATE OR PROVINCE: 

(E) COUNTRY: Denmark 

(F) POSTAL CODE: 8381 

(ii) TITLE OF THE INVENTION: Trimerising module 



(iii) NUMBER OF SEQUENCES: 3 4 

(iv) COMPUTER-READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

(2) INFORMATION FOR SEQ ID N0:1: 

( i ) SEQUENCE 'CHARACTERISTICS : 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDN0:1; 
CCTGATCAAT CCAGGGAAGA TCTCCTGGTA CCGAGCCACC AACCCAG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
^ CCAAGCTTAT TAGGATCCCG TCTGCAGGGC CTG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCGAAGCTTA TTAGGATCCC TTCAGGGAGA CCGTCTGCAG . 40 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Gly Ser lie Gin Gly Arg 
T 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Pro Pro Thr Gin Lys Pro Lys Lys lie Val Asn Ala Lys Lys Asp 

T 5 10 15 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

20 25 30 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala -Leu Gin Thr 

35 40 45 

Val Ser Leu Lys 
50 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 
T 5 10 15 
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Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

20 25 30 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

lie Val Asn Ala Lys Lys Asp 
10 15 
Leu Lys Ser Arg Leu Asp Thr 
30 

Glu Gin Gin Ala Leu Gin Thr 
45 

Met Lys Cys Phe Leu Ala Phe 
60 

Ser Glu Asp Cys He Ser Arg 

75 80 
Gly Ser Glu Asn Asp Ala Leu 
90 95 
Asn Glu Ala Glu He Trp Leu 
110 

Thr Trp Val Asp Met Thr Gly 
125 

Thr Glu He Thr Ala Gin Pro 
140 

Val Leu Ser Gly Ala Ala Asn 
155 160 
Asp Gin Leu Pro Tyr He Cys 
170 175 

Val 
180 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Glu Pro Pro Thr 
1 


Gin Lys 

5 


Pro 


Lys Lys 


Val Val Asn Thr 


Lys Met 


Phe 


Glu Glu 


20 






25 


Leu Ala Gin Glu 


Val Ala 


Leu 


Leu Lys 


35 






40 


Val Cys Leu Lys 


Gly Thr 


Lys 


Val His 


50 




55 




Thr Gin Thr Lys 


Thr Phe 


His 


Glu Ala 


65 


70 






Gly Gly Thr Leu 


Ser Thr 


Pro 


Gin Thr 




85 






Tyr Glu Tyr Leu 


Arg Gin 


Ser 


Val Gly 


100 






105 


Gly Leu Asn Asp 


Met Ala 


Ala 


Glu Gly 


115 






120 


Ala Arg He Ala 


Tyr Lys 


Asn 


Trp Glu 


13 0 




135 




Asp Gly Gly Lys 


Thr Glu 


Asn 


Cys Ala 


145 


150 






Gly Lys Trp Phe 


Asp Lys 


Arg 


Cys Arg 




165 






Gin Phe Gly He 


Val 







(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCTGGATCCA TCGAGGGTAG GGGCGAGCCA CCAACCCAG 
(2) INFORMATION FOR SEQ ID NO: 9: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCGAAGCTTA CACGATCCCG AACTG 25 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ■ 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:10: 

Gly Ser lie Glu Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

.Met Val Arg Ala Asn Lys Arg Asn Glu Ala Leu Arg lie Glu Ser Ala 

1 5 10 15 

Leu Leu Asn Lys lie Ala Met Leu Gly Thr Glu Lys Thr Ala Glu Gly 
20 25 ■ 30 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gly Ser His His His His His His Gly Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) .TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CCGAAGCTTA GACCGTCTGC AGGGC 

(2) INFORMATION FOR SEQ ID NO: 14: 



■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGCGGATCCA TCCAGGGTAG GGTTGTVSAAC ACAAAGATG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCTGGATCCA TCGAGGGTAG GGCCCTGCAG ACGGTC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



ATGCAGATCT TTGTGAAGAC CCTCACTGGC AAAACCATCA CCCTTGAGGT CGAGCCCAGT . 60 

GACACCATTG AGAATGTCAA AGCCAAAATT CAAGACAAGG AGGGTATCCC ACCTGACCGC " 120 

AGCGTCTGAT ATTTGCCGGC AAACAGCTGG AAGATGGACG TACTTTGTCT GACTACAATA 18 0 

TTCAAAAGGA GTCTACTCTT CATCTTGTGT TGAGACTTCG TGGTGGT 227 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGCTGATCAC AGATCTTTGT GAAGACC 27 
(2) liSTFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CGCAAGCTTG CATGCTTAGG ATCCACCACG AAGTCTCAA 3 9 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Gin He Phe Val' Lys Thr Leu Thr Gly Lys Thr He Thr Leu Glu 

15 10 15 

Val Glu Pro Ser Asp Thr He Glu Asn Val Lys Ala Lys He Gin Asp 

20 25 30 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys 

35 40 45 

Gin Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu 

50 55 60 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



CAGGTTCAGC TGCAGCAGTC AGGGGCTGAG GTGAAGAAGC CTGGGTCCTC GGTGAAGGTC 



60 
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TCCTGCAAGG CTTCTGGAGG CACCTTCAGC AACTCTCCTA TCAACTGGCT GCGACAGGCC 
CCCGGACAAG GGCTTGAGTG GATGGGAAGT ATCATCCCTT CCTTTGGTAC AGCAAACTAC 
GCTCAGAAGT TCCAGGGCAG ACTCACGATT ACCGCGGACG AATCCACGAG CACAGCCTAC 
ATGGAGCTGA GCAGCCTGAG ATCTGAGGAC ACGGCCGTGT ATTACTGTGC GGGGCGGAGC 
CACAACTACG AACTCTACTA TTACTACATG GACGTCTGGG GCCAGGGGAC AATGGTCACC 
GTCTCGAGTG GTGGAGGCGG TTCAGGCGGA GGTGGCAGCG GCGGTGGCGG ATCGGACATC 
CAGATGACCC AGTCTCCTTC CACCCTGTCT GCATCTATTG GAGACAGAGT CACCATCACC 
TGCCGGGCCA GTGAGGGTAT TTATCACTGG TTGGCCTGGT ATCAGCAGAA GCCAGGGAAA 
GCCCCTAAAC TCCTGATCTA TAAGGCCTCT AGTTTAGCCA GTGGGGCCCC ATCAAGGTTC 
AGCGGCAGTG GATCTGGGAC AGATTTCACT CTCACCATCA GCAGCCTGCA GCCTGATGAT 
TTTGCAACTT ATTACTGCCA ACAATATAGT AATTATCCGC TCACTTTCGG CGGAGGGACC 
AAGCTGGAGA TCAAACGTGC GGCCGCAGAA CAAAAACTCA TCTCAGAAGA GGATCTGAAT 
GGGGCC 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGTGGATCCC AGGTTCAGCT GCAGC 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCCGGTACCG GCCCCATTCA GATCC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:23: 
TCCAAGCTTA GGCCCCATTC AGATCC 

(2) INFORMATION FOR SEQ ID NO: 24: 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
786 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:24: 



Met 


Val 


Arg Ala 


Asn 


Lys 


Arg 


Asn' 


Glu Ala Leu Arg He Glu Ser 


Ala 


1 






5 








10 15 




Leu 


Leu 


Asn Lys 


He 


Ala 


Met 


Leu Gly Thr Glu Lys Thr Ala Glu 


Gly 






20 










25 30 




Glv 


Ser 


His His 


His 


His 


His 


His 


Gly Ser He Glu Gly Arg Gly 


Glu 






35 








*± \J 


45 




Pro 


Pro 


Thr Gin 


Lvs 


Pro 


Lvs 


Lvs 


He Val Asn Ala Lys Lys Asp 


Val 




50 








55 




D u 




Vat 


Asn 




Met 


Phe 


Glu 


Glu 


Leu Lvs Ser Ara Leu Asp Thr 


Leu 


65 








70 








80 


Ala 


Gin 


Glu Val 


Ala 


Leu 


Leu 


Lys 


Glu Gin Gin Ala Leu Gin Thr 


Val 








85 












Cys 


Leu 


Lys Gly 


Thr 


Lys 


Val 


His 


Met Lys Cys Phe Leu Ala Phe 


Thr 






100 














Gin 


Thr 


Lys Thr 


Phe 


His 


Glu 


Ala 


Ser Glu Asp Cys He Ser Arg 


Gly 






115 








12 0 


125 






TViT* 
±IIjL 


T.011 Got* 


Thr 


Pro 


Gin 


Thr 


Gly Ser Glu Asn Asp Ala Leu 


Tvr 




13 0 








135 




140 




Glu 


Tyr 


Leu Arg 


Gin 


Ser 


Val 


Gly 


Asn ^ Glu Ala Glu He Trp Leu 


Gly 


145 








150 






155 


160 


Leu 


Asn 


Asp Met 


Ala 


Ala 


Glu 


Gly 


Thr Trp Val Asp Met Thr. Gly 


Ala 








165 








170 175 




Arg 


He 


Ala Tyr 


Lys 


Asn 


Trp 


Glu 


Thr Glu He Thr Ala Gin Pro 


Asp 






180 










185 190 




Gly 


Gly 


Lys Thr 


Glu 


Asn 


Cys 


Ala 


Val Leu Ser Gly Ala Ala Asn 


Gly 






195 








200 


205 




Lys 


Trp 


Phe Asp 


Lys 


Arg 


Cys 


Arg 


Asp Gin Leu Pro Tyr He Cys 


Gin 




210 








215 




220 




Phe 


Gly 


He Val 














225 



















(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Gly Ser His His His His His His Gly Ser He Glu Gly Arg Gly 

1 5 10 . 15 

Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 

20 25 30 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

35 40 45 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 
50 55 60 
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Val Cys Leu Lys Gly Thr Lys Val His Mel Lys Cys Phe Leu Ala Phe 
65 70 75 80 

Thr Gin Thr Lys Thr Phe His Glu Ala Ser Glu Asp Cys lie Ser Arg 

85 90 95 

Gly Gly Thr Leu Ser Thr Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu 

100 105 110 

Tyr Glu Tyr Leu Arg Gin Ser Val Gly Asn Glu Ala Glu He Trp Leu 

115 120 125 

Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val Asp Met Thr Gly 

130 135 140 

Ala Arg He Ala Tyr Lys Asn Trp Glu Thr Glu He Thr Ala Gin Pro 
145 150 155 160 

Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser Gly Ala Ala Asn 

165 170 175 

Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu Pro Tyr He Cys 

180 185 190 

Gin Phe Gly He Val 
195 

(2) INFORMATION FOR SEQ ID NO: 26 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:26: 

Met Gly Ser His His His His His His Gly Ser He Glu Gly Arg Gly 

T 5 10 15 

Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 

20 25 30 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

35 40 45 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 
50 55 6 0 

Val 
65 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Gly Ser His His His His His His Gly Ser He Gin GlyArgVal 

T 5 10 15 

Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 
20 25 30 
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Ala Gin 


Glu 


Val 


Ala 


Leu 


Leu 


Lvs 


Glu 


Gin 


Gin 


Ala Leu Gin Thr Val 




35 










40 








45 


Cys Leu 


Lys 


Gly 


Thr 


Lys 


Val 


His 


Met 


Lys 


Cys 


Phe Leu Ala Phe Thr 






















60 


Gin Thr 


Lvs 


Thr 


Phe 


His 


Glu 


Ala 


Ser 


Glu 


Asp 


Cys He Ser Arg Gly 


o o 








70 










75 


80 


Gly, Thr 


Leu 


Ser 


Thr 


Pro 


Gin 


Thr 


Gly 


Ser 


Glu 


Asn Asp Ala Leu Tyr 








ft ^ 














95 


Glu Tyr 


Leu 


Ara 


Gin 


Ser 


Val 


Gly 


Asn 


Glu 


Ala 


Glu He Trp Leu Gly 






100 










105 






110 


Leu Asn 


Asp 


Met 


Ala 


Ala 


Glu 


Gly 


Thr 


Trp 


Val 


Asp Met Thr Gly Ala 




X X .7 










120 








125 


Arg lie 


Ala 


Tvr 

A jr J. 


Lys 


Asn 


Tro 


Glu 


Thr 


Glu 


He 


Thr Ala Gin Pro Asp 


130 










135 










140 


Gly Gly 


Lys 


Thr 


Glu 


Asn 


Cys 


Ala 


Val 


Leu 


Ser 


Gly Ala Ala Asn Gly 


145 








150 










155 


160 


Lys Trp 


Phe 


Asp 


Lys 


Arg 


Cys 


Arg 


Asp 


Gin 


Leu 


Pro Tyr He Cys Gin 








165 










170 




175 


Phe Gly 


He 


Val 



















180 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Met Gly 


Ser His 


His 


His His His His Gly Ser He Gin Gly Arg Ser 






5 




10 15 


Pro Gly 


Thr Glu 


Pro 


Pro Thr Gin Lys 


Pro Lys Lys He Val Asn Ala 




20 




25 


30 


Lys Lys 


Asp Val 


Val 


Asn Thr Lys Met 


Phe Glu Glu Leu Lys Ser Arg 




35 




40 


45 


Leu Asp 


Thr Leu 


Ala 


Gin Glu Val Ala 


Leu Leu Lys Glu Gin Gin Ala 


50 






55 


60 


Leu. Gin 


Thr Val 


Ser 


Leu Lys Gly Ser 




65 






70 





(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Ser 
T 5 10 15 
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Pro Gly Thr Glu Pro Pro Thr Gin Lys 

20 25 
Lys Lys Asp Val Val Asn Thr Lys Met 

35 40 
Leu Asp Thr Leu Ala Gin Glu Val Ala 

50 55 
Leu Gin Thr Gly Ser 
65 
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Pro Lys Lys lie Val Asn Ala 
30 

Phe Glu Glu Leu Lys Ser Arg 
45 

Leu Leu Lys Glu Gin Gin Ala 
60 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Met Gly Ser His His His His His His Gly Ser He Glu Gly Arg Ala 

15 10 15 

Leu Gin Thr Val Cys Leu Lys Gly Thr Lys Val His Met Lys Cys Phe 

20 25 30 

Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala Ser Glu Asp Cys 

35 40 45 

He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr Gly Ser Glu Asn 

50 55 60 

Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly Asn Glu Ala Glu 

65 70 75 80 

He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val Asp 

85 90 95 

Met Thr Gly Ala Arg He Ala Tyr Lys Asn Trp Glu Thr Glu He Thr 

100 105 110 

Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser Gly 

115 120 125 

Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu Pro 

130 135 140 

Tyr lie Cys Gin Phe Gly He Val 
145 150 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Ser 

1 5 10 15 

Pro Gly Thr Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala 
20 25 30 



wo 98/56906 PCT/DK98/00245 

69 



Lys 


Lys 


Asp 


Val 


Val Asn 


Thr 


Lys 


Met 


Phe 


Glu 


Glu Leu Lys Ser Arg 






35 








40 








45 


Leu Asp 


Thr 


Leu 


Ala Gin 


Glu 


Val 


Ala 


Leu 


Leu 


Lys Glu Gin Gin Ala 




50 








55 










60 


Leu 


Gin 


Thr 


Gly 


Ser Gin 


He 


Phe 


Val 


Lys 


Thr 


Leu Thr Gly Lys Thr 


65 








70 










75 


80 


He 


Thr 


Leu 


Glu 


Val Glu 


Pro 


Ser 


Asp 


Thr 


He 


Glu Asn Val Lys Ala 










85 








90 




95 


Lys 


lie 


Gin 


Asp 


Lys Glu 


Gly 


lie 


Pro 


Pro 


Asp 


Gin Gin Arg Leu He 








100 








105 






110 


Phe 


Ala 


Gly 


Lys 


Gin Leu 


Glu 


Asp 


Gly 


Arg 


Thr 


Leu Ser Asp Tyr Asn 






1 1 5 








120 








1 25 


He 


Gin 


Lys 


Glu 


Ser Thr 


Leu 


His 


Leu 


Val 


Leu 


Arg Leu Arg Gly Gly 




130 








135 










140 



Ser 



1 45 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Gly Ser His His His His His His Gly Ser lie Gin Gly Arg Ser 

15 10 15 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ser 

20 25 30 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Ser Asn Ser 

35 ' 40 45 

Pro He Asn Trp Leu Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp Met 

50 55 60 

Gly Ser He He Pro Ser Phe Gly Thr Ala Asn Tyr Ala Gin Lys Phe 
65 70 75 - 80 

Gin Gly Arg Leu Thr He Thr Ala Asp Glu Ser Thr Ser Thr Ala Tyr 

85 90 95 

M^et Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 

100 105 110 

Ala Gly Arg Ser His Asn Tyr Glu Leu Tyr Tyr Tyr Tyr Met Asp Val 

115 120 125 

Trp Gly Gin Gly Thr Met Val Thr Val Ser Ser Gly Gly Gly Gly Ser 

130 135 140 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp He Gin Met Thr Gin 
145 150 155 160 

Ser Pro Ser Thr Leu Ser Ala Ser He Gly Asp Arg Val Thr He Thr 

165 170 175 

Cys Arg Ala Ser Glu Gly He Tyr His Trp Leu Ala Trp Tyr Gin Gin 

180 185 190 

Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Lys Ala Ser Ser Leu 

195 200 ' 205 

Ala Ser Gly Ala Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
210 215 220 
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Phe 


Thr 


Leu Thr 


He 


Ser 


Ser 


Leu 


Gin 


Pro 


Asp Asp 


Phe Ala Thr Tyr 


225 








230 










235 


240 


Tyr 


cys 


Gin Gin 


Tyr 


Ser 


Asn 


Tyr 


Pro 


Leu 


Thr Phe 


Gly Gly Gly Thr 








245 










250 




255 


Lys 


Leu 


Glu He 


Lys 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin Lys 


Leu He Ser Glu 






260 










265 






2 7 0 


Glu 


Asp 


Leu Asn 


Gly 


Ala 


Gly 


Thr 


Glu 


Pro 


Pro Thr 


Gin Lys Pro Lys 






2 75 








280 








2 85 


Lys 


He 


Val Asn 


Ala 


Lys 


Lys 


Asp 


Val 


Val 


Asn Thr 


Lys Met Phe Glu 




290 








2 95 








300 




Glu 


Leu 


Lys Ser 


Arg 


Leu 


Asp 


Thr 


Leu 


Ala 


Gin Glu 


Val Ala Leu Leu 


305 








310 










315 


320 


Lys 


Glu 


Gin Gin 


Ala 


Leu 


Gin 


Thr 


Gly 


Ser 












325 










330 







(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



Met 


Gly 


Ser 


His 


His 


His 


His 


His 


His 


Gly 


Ser 


He 


Gin Gly Arg Ser 


1 








5 










10 






15 


Pro 


Gly 


Thr 


Glu 


Pro 


Pro 


Thr 


Gin 


Lys 


Pro 


Lys 


Lys 


He Val Asn Ala 








20 










25 








30 


Lys 


Lys 


Asp 


Val 


Val 


Asn 


Thr 


Lys 


Met 


Phe 


Glu 


Glu 


Leu Lys Ser Arg 






35 










40 










45 


Leu 


Asp 


Thr 


Leu 


Ala 


Gin 


Glu 


Val 


Ala 


Leu 


Leu 


Lys 


Glu Gin -Gin Ala 




50 










55 










60 




Leu 


Gin 


Thr 


Gly 


Ser 


Gin 


Val 


Gin 


Leu 


Gin 


Gin 


Ser 


Gly Ala Glu Val 


65 










70 










75 




80 


Lys 


Lys 


Pro 


Gly 


Ser 


Ser 


Val 


Lys 


Val 


Ser 


Cys 


Lys 


Ala Ser -Gly Gly 










85 










90 






95 


Thr 


Phe 


Ser 


Asn 


Ser 


Pro 


He 


Asn 


Trp 


Leu 


Arg 


Gin 


Ala Pro Gly Gin 








100 










105 








110 


Gly 


Leu 


Glu 


Trp 


Met 


Gly 


Ser 


He 


He 


Pro 


Ser 


Phe 


Gly Thr Ala Asn 






115 










120 










125 


Tyr 


Ala 


Gin 


Lys 


Phe 


Gin 


Gly 


Arg 


Leu 


Thr 


He 


Thr 


Ala Asp Glu Ser 




13 0 










135 










140 




Thr 


Ser 


Thr 


Ala 


Tyr 


Met 


Glu 


Leu 


Ser 


Ser 


Leu 


Arg 


Ser Glu Asp Thr 


145 










150 










155 




160 


Ala 


Val 


Tyr 


Tyr 


Cys 


Ala 


Gly 


Arg 


Ser 


His 


Asn 


Tyr 


Glu Leu Tyr Tyr 










165 










170 






175 


Tyr 


Tyr 


Met 


Asp 


Val 


Trp 


Gly 


Gin 


Gly 


Thr 


Met 


Val 


Thr Val Ser Ser 








180 










185 








190 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Gly Gly Ser Asp 






195 










200 










205 


He 


Gin 


Met 


Thr 


Gin 


Ser 


Pro 


Ser 


Thr 


Leu 


Ser 


Ala 


Ser He Gly Asp 




210 










215 










220 




Arg 


Val 


Thr 


He 


Thr 


Cys 


Arg 


Ala 


Ser 


Glu 


Gly 


He 


Tyr His Trp Leu 


225 










230 










235 




240 
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Ala Trp 


Tyr 


Gin Gin Lys 


Pro Gly 


Lys 






245 






Lys Ala 


Ser 


Ser Leu Ala 


Ser Gly 


Ala 






260 




265 


Gly Ser 


Gly 


Thr Asp Phe 


Thr Leu 


Thr 




275 




280 




Asp Phe 


Ala 


Thr Tyr Tyr 


Cys Gin 


Gin 


290 






295 




Phe Gly 


Gly 


Gly Thr Lys 


Leu Glu 


lie 


305 




310 






Lys Leu 


He 


Ser Glu Glu 


Asp Leu 


Asn 



325 



(2) INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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7 


1 






Ala 


Pro 


Lys Leu Leu 


lie Tyr 


250 






255 


Pro 


Ser 


Arg Phe Ser Gly Ser 






270 




He 


Ser 


Ser Leu Gin 


Pro Asp 






285 




Tyr 


Ser 


Asn Tyr Pro 


Leu Thr 






300 




Lys 


Arg 


Ala Ala Ala 


Glu Gin 




315 




320 


Gly 


Ala 






330 









NO: 34: 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO;34: 



Met Gly Ser 


His 


His His His His His 


Gly 


Ser 


He Gin Gly Arg Ser 


1 




5 


10 




15 


Gin Val Gin 


Leu 


Gin Gin Ser Gly Ala 


Glu 


Val 


Lys Lys Pro Gly Ser 




20 


25 






30 


Ser Val Lys 


Val 


Ser Cys Lys Ala Ser 


Gly 


Gly 


Thr Phe Ser Asn Ser 


• 35 




40 






45 


Pro lie Asn 


Trp 


Leu Arg Gin Ala Pro 


Gly 


Gin 


Gly Leu Glu Trp Met 


50 




55 






60 


Gly Ser lie 


He 


Pro Ser Phe Gly Thr 


Ala 


Asn 


Tyr Ala Gin Lys Phe 


65 




70 




75 


80 


Gin Gly Arg 


Leu 


Thr He Thr Ala Asp 


Glu 


Ser 


Thr Ser Thr Ala Tyr 






85 


90 




95 • 


Met Glu Leu 


Ser 


Ser Leu Arg Ser Glu 


Asp 


Thr 


Ala Val Tyr Tyr Cys 




ioo 


105 






110 


Ala Gly Arg 


Ser 


His Asn Tyr Glu Leu 


Tyr 


Tyr 


Tyr Tyr Met Asp Val 


115 




120 






125 


Trp Gly Gin 


Gly 


Thr Met Val Thr Val 


Ser 


Ser 


Gly Gly Gly Gly Ser 


13 0 




135 






140 


Gly Gly Gly 


Gly 


Ser Gly Gly Gly Gly 


Ser 


Asp 


lie Gin Met Thr Gin 


145 




150 




155 


160 


Ser Pro Ser 


Thr 


Leu Ser Ala Ser He 


Gly 


Asp 


Arg Val Thr He Thr 






165 


170 




175 


Cys Arg Ala 


Ser 


Glu Gly He Tyr His 


Trp 


Leu 


Ala Trp Tyr Gin Gin 




180 


185 






190 


Lys Pro G 1 y 


Lys 


Ala Pro Lys Leu Leu 


lie 


Tyr 


Lys Ala Ser Ser Leu 


195 




200 






205 


Ala Ser Gly 


Ala 


Pro Ser Arg Phe Ser 


Gly 


Ser 


Gly Ser Gly Thr Asp 


210 




215 






220 


Phe Thr Leu 


Thr 


He Ser Ser Leu Gin 


Pro 


Asp 


Asp Phe Ala Thr Tyr 


.225 




230 




235 


240 


Tyr Cys Gin 


Gin 


Tyr Ser Asn Tyr Pro 


Leu 


Thr 


Phe Gly Gly Gly Thr 






245 


250 




255 
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Lys Leu 


Glu 


lie 


Lys 


Arg 


Ala 


Ala 


Ala 






260 










265 


Glu Asp 


Leu 


Asn 


Gly 


Ala 


Gly 


Thr 


Glu 




275 










280 




Lys He 


Val 


Asn 


Ala 


Lys 


Lys 


Asp 


Val 


290 










295 






Glu Leu 


Lys 


Ser 


Arg 


Leu 


Asp 


Thr 


Leu 


305 








3T 0 








Lys Glu 


Gin 


Gin 


Ala 


Leu 


Gin 


Thr 


Gly 








325 








Ser Gly 


Ala 


Glu 


Val 


Lys 


Lys 


Pro 


Gly 






340 










345 


Lys Ala 


Ser 


Gly 


Gly 


Thr 


Phe 


Ser 


Asn 




355 










360 




Gin Ala 


Pro 


Gly 


Gin 


Gly 


Leu 


Glu 


Trp 


370 










375 






Phe Gly 


Thr 


Ala 


Asn 


Tyr 


Ala 


Gin 


Lys 


385 








3 90 






Thr Ala 


Asp 


Glu 


Ser 


Thr 


Ser 


Thr 


Ala 








405 










Arg Ser 


Glu 


Asp 


Thr 


Ala 


Val 


T V r 


Tvr 






420 










425 


Tyr Glu 


Leu 


Tyr 


Tyr 


Tyr 


Tyr" 


M e t 


Asp 




435 










44 0 




Val Thr 


Val 


Ser 


Ser 


Gly 


Gly 


Gly 


Gly 


450 










455 






Gly Gly 


Gly 


Ser 


Asp 


He 


Gin 


M e t 


Thr 


465 








470 








Ala Ser 


lie 


Gly 


Asp 


Are 

o 


Val 


Thr 


He 








485 










lie Tyr 


His 


Trp 


Leu 


Ala 


Trp 


Tyr 


Gin 






500 










505 


Lys Leu 


Leu 


He 


Tyr 


Lys 


Ala 


Ser 


Ser 




515 










520 




Arg Phe 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 


Thr 


530 










535 






Ser Leu 


Gin 


Pro 


Asp 


Asp 


Phe 


Ala 


Thr 


545 








550 








Asn Tyr 


Pro 


Leu 


Thr 


Phe 


Gly 


Gly 


Gly 








565 










Ala Ala 


Ala 


Glu 


Gin 


Lys 


Leu 


He 


Ser 






580 










585 
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Glu 


Gin Lvs 


Jpii ^pr CI 11 
LrfCu xxc oci vjiu 








Pro 


Pro Thr 


nin luc Prn I\7c 






£.OiJ 


Val 


Asn Thr 


Lvs Met Php Gill 




300 




Ala 


Gin Glu 


Val Ala Leu Leu 




315 




Ser 


Gin Val 


Gin T P11 Gin n 1 n 

vJ i 11 l_iCU vJ 1 11 VT 1 11 


330 




7 7 


Ser 


Ser Val 


l Vfll ^pr Pvc 








Ser 


Pro He 


A ? n "yrn Ton A r n 
r\oll -lip LcU rtrg 






t^ 

ODD 


Met 


Gly Ser 


Hp Hp Prn Spr 

xx«^ ilU OCX 




o ou 




Phe 


Gin Gly 


Arg I p 11 Thr Tip 

1 g L( C U J. IJl X 1 C 




Q c; 

J y o 


400 


Tvr 


Met Glu 


I p II Spr ^pr I All 

XjCU OCl OCl l^CU 


4 1 n 




415 


Cys 


A 1 n 1 V 


rtrg oer nis Asn 








Val 


Trp Gly 


Gin n 1 V Thr \yf p t 
vjiii \j I y 1111 Mel 






>l /I c 


Ser 


G 1 V G 1 V 


Clv Glv Spr filv 

vJlj vjiy OCl \jr I y 




D U 




Gin 


Ser Pro 


Spr Thr T on Qor 

oci 1 111 j_( c u oer 




H f D 


480 


Thr 


^y5> ^rg 


A 1 a Qo-r P 1 II P 1 If 

/\ 1 a oer LtIU uiy 


4 on 
y u 




4 95 


Gin 


Lys Pro 


niv Tvc Ala Prn 

oiy t-ya /\ia. rro 








Leu 


Ala S pr 


Glv Ala Prn Spf 






1? 0 C 


Asp 


Phe Thr 


Leu Thr He Ser 




540 




Tyr 


Tyr Cys 


Gin Gin Tyr Ser 




555 


560 


Thr 


Lys Leu 


Glu He Lys Arg 


570 




575 


Glu 


Glu Asp 


Leu Asn Gly Ala 



590 
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• CLAIMS 

1. A monomer polypeptide construct comprising at least one 
tetranectin trimerising structural element (TTSE) which is 
covalently linked to at least one heterologous moiety, said 
5 TTSE being capable of forming a stable complex with two other 
TTSEs, with the proviso that the heterologous moiety is 
different from any of the fusion proteins CIIH6FXTN123 , 
H6FXTN123, H6FXTN12, H6FXTN23, the sequences of which are 
shown in SEQ ID NOs : 24-2 7. 

10 2 . A monomer polypeptide construct comprising at least one 
tetranectin trimerising structural element (TTSE) which is 
covalently linked to at least one heterologous moiety, said 
TTSE being capable of forming a stable complex with two other 
TTSEs, said at least one heterologous moiety being one which 

15 does not exclusively facilitate expression and/or purif ica- . 
tion of the monomer polypeptide construct. 

3 . A monomer polypeptide construct according to claim l or 2 , 
wherein the stable complex includes a triple alpha helical 
coiled coil. 

20 4 - A monomer polypeptide construct according to any of the 

preceding claims, wherein the heterologous moiety is selected 
from the group consisting of a ligand binding structure; a 
toxin; a detectable label; an in situ activatable substance; 
an enzyme; a radioactive moiety; a cytokine; a non-protein- 

25 aceous polymer such as a polymeric alkaloid, a polyalcohol, a 
polysaccharide, a lipid and a polyamine; a photo cross -lin- 
king agent; and a group facilitating conjugation of the 
monomer polypeptide construct to a target. 

5 . A monomer polypeptide construct according to any of the 
30 preceding claims, which comprises 2 TTSEs which are covalent- 
ly linked by a spacer moiety which allows both of the 2 TTSEs 
to take part in complex formation with a third TTSE not being 
part of the monomer polypeptide construct. 
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6 . A monomer polypeptide construct according to claim 5 , 
wherein the spacer moiety has a length and a conformation 
which favours complex formation involving both of the two 
TTSEs which are covalently linked by the spacer moiety. 

5 7. A monomer polypeptide construct according to claim 5 or 6 , 
wherein the spacer moiety is a polypeptide fragment. 

8. A monomer polypeptide construct according to any of claims 
1-4, which comprises one single TTSE. 

9 . A monomer polypeptide construct according to any of the 
10 preceding claims, wherein the TTSE is derived from human 

tetranectin, murine tetranectin, C-type lectin of bovine 
cartilage, or C-type lectin of shark cartilage. 

10. A monomer polypeptide construct according to claim 9, 
wherein the TTSE comprises a polypeptide sequence which has 
at least 58% sequence identity with the consensus sequence 
shown in Fig . 2 . 



15 



11. A monomer polypeptide construct according to claim lo, 
wherein the sequence identity with the consensus sequence is 
at least 75%, such as at least 81%, at least 87%, or at least 

20 92%. 

12. A monomer polypeptide construct according to any of 
claims 9-11, wherein at least one amino acid residue selected 
from the group consisting of amino acid residue nos. 6, 21, 
22. 24, 25, 27, 28. 31, 32, 35, 39, 41, 42, is/are substitu- 

25 ted by any non-helix breaking amino acid residue, the amino 
acid residue numbering referring to amino acid residues in 
SEQ ID NO: 7. 



30 



13. A monomer polypeptide construct according to any of the 
preceding claims, wherein the at least one TTSE comprises a 
repeated heptad having the formula a-b-c-d-e-f -g (N to C) , 
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wherein a majority of the amino acids residues a and d are 
hydrophobic amino acids . 



14. A monomer polypeptide construct according to claim 13, 
wherein heptad is repeated 3 times and wherein the last 

5 occurrence of the heptad has a glut amine residue correspon- 
ding to residues a and d. 

15. A monomer polypeptide construct according to any of the 
preceding claims, wherein the at least one heterologous 
moiety is covalently linked to the TTSE via a peptide bond to 

10 the N- or C-terminus of the TTSE peptide chain, via a peptide 
bond to a side chain in the TTSE, via a bond to a cysteine 
residue, or when more than one heterologous moity, 
combinations of these locations. 



16. A monomer polypeptide construct according to any of the 
15 preceding claims which lacks any free amino and/or carboxy 

groups . 

17 . A monomer polypeptide construct according to any of the 
preceding claims which lacks a substantial part of the N- 
terminal region of tetranectin which is encoded by exon i . 

20 18. A monomer polypeptide construct according to any of the 
preceding claims comprising two heterologous moieties which 
are linked via peptide bonds to the N- and C-terminus, re- 
spectively. 

19. A monomer polypeptide construct according to any of the 
2 5 preceding claims which is constructed so as to disfavour 
formation of complexes between identical TTSEs . 

2 0. An oligomer which is comprised of two monomer pol.ypeptide 
constructs according to any of claims 1-19, and which com- 
prises at three TTSE's or a multiplum of three TTSE's, or 
30 which is comprised of three monomer polypeptide constructs 
according to any of claims 1-4 or 8-19. 
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21. An oligomer according to claim 20 which is stable in the 
temperature range 50-70°C. 

22. An oligomer according to claim 20 or 21, which comprises 
at least one heterologous moiety which is positioned N-termi- 

5 nally to a TTSE and at least one heterologous moiety which is 
positioned C-terminally to a TTSE. 

23. An oligomer according to claim 22, wherein th6 at least 
one heterologous moiety which is positioned N-terminally to a 
TTSE and the at least one heterologous moiety which is posi- 

10 tioned C-terminally to a TTSE are part of the same monomeric 
polypep t ide cons t rue t . 



24. An oligomer according to claim 22, wherein the at least 
one heterologous moiety which is positioned N-terminally to a 
TTSE and the at least one heterologous moiety which is posi- 
15 tioned C-terminally to a TTSE are part of two separate mono- 
meric polypeptide constructs. 



25. An oligomer according to any of claims 20-24, wherein 
each monomer polypeptide construct is designed so as to 
disfavour formation of trimers including two monomer polypep- 

20 tide constructs having identical TTSEs . 

26. A method of preparing a monomer polypeptide construct 
according to any of claims 1-19, the method comprising 



isolating the monomer polypeptide construct from a cul- 
ture comprising a host cell which carries and expresses a 
25 nucleic acid fragment which encodes the monomer polypep- 

tide construct, or 



synthesizing, by means of chemical peptide synthesis, the 
monomer polypeptide construct and subsequently isolating 
the monomer polypeptide construct from the reaction 
30 mixture, or 
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preparing a TTSE in a culture comprising a host cell 
which carries and expresses a nucleic acid fragment which 
encodes the TTSE, subsequently linking covalently at 
least one heterologous moiety to the TTSE, and thereafter 
5 isolating the resulting monomer polypeptide construct, or 

synthesizing, by means of chemical peptide synthesis, a 
TTSE, subsequently linking covalently at least one hete- 
rologous moiety to the TTSE, and thereafter the isolating 
the resulting monomer polypeptide construct from the 
10 reaction mixture, 

and optionally subjecting the monomer polypeptide construct 
to further processing. 

27 . A method for preparing a dimeric oligomer according to 
claim 20 which comprises 

15 - admixing a monomer polypeptide construct according to any 
of claims 1-19 which includes two TTSEs (construct 1) 
with a monomer polypeptide construct according to any of 
claims 1-4 or 8-19 which includes only one TTSE (con- 
struct 2) , 

20 - effecting the two TTSE's of construct 1 to complex with 
the TTSE of construct 2 , and 

isolating the resulting dimer and optionally subjecting 
the dimer to further processing. 

28. A method for preparing a trimeric oligomer according to 
25 claim 20 which comprises 

admixing three monomer polypeptide constructs according 
to any of claims 1-19 with each other, 

effecting complex formation between one TTSE of each 
monomer polypeptide construct, and 
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isolating the resulting trimer and optionally subjecting 
the trimeric oligomer to further processing. 

29. A kit comprising 

a first package comprising at least one container means, 
5 each at least one container means containing a monomer 

polypeptide construct according to any of claims 1-19, 

a second package comprising at least one container means, 
each at least one container means in the second package 
containing a monomer polypeptide construct according to 
10 any of claims 1-19, the second package being different 

from the first package with respect to choice and/or 
number of monomer polypeptide constructs included there- 
in, and optionally 

a third package comprising at least one container means, 
15 each at least one container means in the third package 

containing a monomer polypeptide construct according to 
any of claims 1-19, the second package being different 
from the first and second packages with respect to choice 
and/or number of monomer polypeptide constructs included 
2 0 therein - 

3 0. A kit according to claim 29, wherein the at least one 
container means in each package contains mutually distinct 
monomer polypeptide constructs. 

31. A kit according to claim 29 or 3 0, wherein all container 
25 means comprised in the kit conprises mutually distinct poly- 
peptide constructs. 

32. A nucleic acid fragment in isolated form which encodes a 
TTSE as defined in any of claims 1-19 or which encodes the 
polypeptide part of a monomer polypeptide construct according 

30 to any of claims 1-19, with the proviso that the nucleic acid 
fragment is different from one that encodes native members of 



wo 98/56906 PCT/DK98/00245 

79 

the tetranectin family, and that the nucleic acid fragment is 
different from one that encodes any of the fusion proteins 
CIIH6FXTN123, H6FXTN123, H6FXTN12, H6FXTN23, the sequences of 
which are shown in SEQ ID NOs: 24-27. 

5 33. A replicable vector which comprises a nucleic acid frag- 
ment according to claim 32. 

34. A transformed host cell, which comprises a nucleic acid 
fragment according to claim 32 or a replicable vector accor- 
ding to claim 32 • 

10 35. Use of a monomer polypeptide construct according to any 
of claims i- 19 or to a an oligomer construct according to any 
of claims 20-25 for targeted gene therapy involving selective- 
delivery of the material for transf ection or infection of the 
specific population of cells. 

15 36. The use according to claim 35 wherein the at least one 
heterologous moity comprises a moiety selected from a ligand 
binding structure such as a receptor molecule or the ligand 
binding part of a receptor, molecule, and wherein the gene 
therapy involves the delivery of nucleic acids to the desired 

20 population of cells by use of a viral vector directed to 
cells displaying the artificial receptor complex 
corresponding to the heterologous moity. 

37. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a an oligomer -according to any of 

25 claims 20-25 as a component of a chimaeric product having low 
antigenicity in humans relative to formulations comprising on 
or. more components of non-human origin. 

38. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a an oligomer according to any of 

30 claims 20-25 as a vehicle for assembling antibody fragments 
into oligomeric or multivalent entities for generating 
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chimeric artificial antibodies having preselected 
pharmacokinetic and/or pharmadynamic properties. 



39. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a an oligomer according to any of 
5 claims 2 0-25 for delivering an imaging or toxin-conjugated 
antibody to a tumor. 



40. The use of a monomer polypeptide construct according .to 
any of claims 1-19 or to a oligomer according to any of 
claims 20-25 as a vehicle delivering an substance to a target 

10 cell or tissue. 

41. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 
claims 20-25 for a labelled construct wherein the label is 
coupled to one or to of the TTSE monomer units. 

15 42. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 
claims 20-25 for protein library technology, such as phage 
display technology. 



43. The use according to claim 42 comprising a poly 
20 nucleotide molecule encoding one or more TTSE. 

44 . The use according to claim 43 comprising a vector 
encoding one or more TTSE. 



45. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 

25 claims 20-25 for th^ preparation of a pharmacutical 
composition. 

46. The use according to any to claim 45, wherein the 
pharmaceutical composition further comprises a 
pharmaceutically acceptable excipient. 
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47. The use according to claim 45 or 46 wherein the 
pharmaceutical composition is administered by a route 
selected from the group consisting of the intraveneous route, 
the intraarterial route, the transmembraneus route of the 

5 buccal, anal, vaginal or conjunctival tissue, the intranasal 
route, the pulmonary route, the transdermal route, the 
intramuscular routed, subcutaneous route, intratechal route, 
inoculation into tissue such as a tumour, or by an implant. 

48. The use according to any of claims 35 to 47 

10 wherein the monomer polypeptide construct according to any of 
claims 1-19 or the oligomer according to any of claims 2 0-2 5 
■ is comprised in a liposome 

4 9. A method for treating or preventing of a disease 
comprising administering to the subject in need thereof an 
15 effective amount of a pharmceutical composition as defined in 
any of claims 4 5 and 4 6 . 

50. A method for treating or preventing a disease comprising 
administering to the subject in need thereof an effective 
amount of a relevant pharmceutical coupled to a monomer 

20 polypeptide construct according to any of claims 1-19 or to a 
oligomer according to any of claims 2 0-25. 

51. A method for targeted gene therapy comprising use of a 
monomer polypeptide construct according to any of claims 1-19 
or to a oligomer according to any of claims 20-25. 

25 52 . A method of human gene therapy comprising use of a 

monomer polypeptide construct according to any of claims 1-19 
or to an oligomer according to any of claims 20-25 wherein at 
least one TTSE is modifyed with a membrane integrating or 
associating entity having affinity to the specific population 

30 of cells in the body relevant for the gene therapy. 

53. A method according to any of claims 49 to 52 
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wherein the monomer polypeptide construct according to any of 
claims 1-19 or the oligomer according to any of claims 20-25 
is administered by a route selected from the group consisting 
of the intraveneous route, the intraarterial route, the 
5 transmembraneus route of the buccal, anal og vaginal tissue, 
intranasal route, the pulmonary route, the transdermal route, 
intramuscular, subcutaneous, intratechal, the buccal, 
inoculation into tissue such as a tumour, or by an implant. 

54. A method for prevention and/or treating a disease, 
10 conprising administering to a mammal in need thereof a 

prophylactically or therapeutically effective amount of a 
construct comprising the monomer polypeptide construct 
according to any of claims 1-19 or the oligomer according to 
any of claims 20-25. 



15 



55. A method for diagnosis comprising a construct comprising 
the monomer polypeptide construct according to any of claims 
1-19 or the oligomer according to any of claims 20-25 
together with a diagnosing component coupled thereon. 
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H6FXtripa fusion protein 

1 MGSHHHHHHGSIQGRSPGTEPPTQKPKKIV 30 

31 NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 

61EQQALQTVSLKGS* 73 

HGFXtripB fusion protein 

1 MGSHHHHHHGSIQGRSPGTEPPTQKPKKIV 30 

31 NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 

61EQQALQTGS* 69 



Fig. 4 
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TNI 23 



6SIEGRGEP IV* 
6ATCCATC6AG66TAG66GCGA6CCA ATCGTGTA 



M V R A- Cll -EGGSHHHHHH 
CATATGGTTCGTGCA GAAGGGGGATCGOVTCACaVTCACCATCACG 

Bam HI 



Phage T7 
promoter 



AGCTTGAATTC 

Hind III Eco H 




TN123 



GSIEGRGEP IV* 
GATCCATCGAGGGTAGGGGCGAGCCA ATCGTCTA 



G S H H 





CATATGGGATCGCATCACCATCACCATCACG AGCTTGAATTC 

Bam H Hind 111 Eco H 



Phage 17 
promoter 




Fig. 5 
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CIIH6FXTN123 fusion protein 



1 


MVRANKRNEAL 


R I E S A 


LLNKIAMLGTEKTA 


30 


31 


EGGSHHHHHHG 


S I E G R 


GEPPTQKPKKIVNA 


60 


61 


KKDVVNTKMFE 


E L K S R 


LDTLAQEVALLKEQ 


90 


91 


QALQTVCLKGT 


K V H M K 


CFLAFTQTKTPHEA 


120 


121 


SEDCISRGGTL 


S T P Q T 


GSENDALYEYLRQS 


150 


151 


VGNEAEIWLGL 


N D M A A 


EGTWVDMTGARIAY 


180 


181 


KNWETEITAQP 


D G G K T 


ENCAVLSGAANGKW 


210 


21 1 


FDKRCRDQLPY 


I C Q F G 


I V * 


228 


H6FXTN123 fusion protein 






1 


MGSHHHHHHGS 


I E G R G 


EPPTQKPKKIVNAK 


30 


31 


KDVVNTKMFEE 


L K S R L 


DTLAQEVALLKEQQ 


60 


6 1 


ALQTVCLKGTK 


VHMKCFLAFTQTKTFHEAS 


90 


91 


EDCISRGGTLS 


T P Q T G 


SENDALYEYLRQSV 


120 


121 


GNEAEIWLGLN 


D M A A E 


GTWVDMTGARIAYK 


150 


151 


NWETEITAQPD 


G G K T E 


NCAVLSGAANGKWF 


180 


181 


DKRCRDQIiPYI 


C Q F G I 


V * 


197 



Fig. 6 
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TM10 GSIEGRGEPP QTV* 
1 N I ^ GATCCATCGAGGGTAGGGGCGAGCCACCA CAGACGGTCTA 



TMQQ GSIQG RVVNT GIV* 
■ 6ATCCATCCAGGGTAGGGTTGTGAACACA GGGATCGTGTA 



GSIEGRA LQ GIV* 
■ GATCCATCGAGGGTAGGGCCCTGCAG GGGATCGTGTA 



MGSHHHHHH 

CATATGGGATCGCATCACCATCACCATCACG AGCTTGAATTC 

Bam HI Hind III Eco H 




Fig. 7 
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H6FXTN12 fusion protein 

1 MGSHHHHHHGSIEGRGEPPTQKPKKIVNAK 30 

31 KDVVNTKMFEELKSRLDTLAQEVALLKEQQ 60 

61 A L Q T V * 65 



H6FXTN23 fusion protein 

1 MGSHHHHHHGSIQGRVVNTKMFEELKS RL D 30 

31 TLAQEVALLKEQQALQTVCLKGTKVHMKCF 60 

61 LAFTQTKTFHEASEDCISRGGTLSTPQTGS 90 

91 ENDALYEYLRQSVGNEAEIWLGLNDMAAEG 120 

121 TWVDMTGARIAYKNWETEITAQPDGGKTEN 150 

151 CAVLSGAANGKWFDKRCRDQLPYICQFGIVISO 

181 * 

H6FXTN3 fusion protein 

1 MGSHHHHHHGSIEGRALQTVCLKGTKVHMK 30 

31 CFLAFTQTKTFHEASEDCISRGGTLSTPQT 60 

61 GSENDALYEYLRQSVGNEAEIWLGLNDMAA 90 

91 EGTWVDMTGARIAYKNWETEITAQPDGGKT120 

121 ENCAVLSGAANGKWFDKRCRDQLPYICQFG150 

1 5 1 I V * 1 52 



Fig. 8 
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Fig. 9 
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H6FXtrlpb-UB fusion protein 



1 MGSHHHHHHGSIQGRSPGTEPPTQKPKKIV 30 

31 NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 

61 EQQALQTGSQIPVKTLTGKTITLEVEPSDT 90 

91 lENVKAKIQDKEGIPPDQQRIiIFAGKQLEDl20 

121 GRTLSDYNIQKESTLHLVLRLRGGS*145 
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H6FXscFv(CEA6Hripb fusion protein 

IMGSHHHHHHGSIQGRSQVQLQQSGAEVKKP 30 
31GSSVKVSCKASGGTFSNSPINWLRQAPGQG 60 
61LEWMGSIIPSFGTANYAQKFQGRLTITADE 90 
9ISTSTAYMELSSI1RSEDTAVYYCAGRSHNYE 120 
121 LYYYYMDVWGQGTMVTVSSGGGGSGGGGSG 150 
151 GGGSDIQMTQSPSTLSASIGDRVTITCRAS 180 
181 EGIYHWLAWYQQK P G KAPKLLIYKASSLAS 210 
211 GAPSRFSGSGSGTDFTLTISSLQPDDFATY 240 
241 YCQQYSNYPLTFGGGTKLEIKRAAAEQKLI 270 
271 SEEDLNGAGTEPPTQKPKKIVNAKKDVVNT 300 
301 KMFEELKSRLDTLAQEVALLKEQQALQTGS 330 
331 * 



Fig. 16 
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H6FXtrlpb-scFv(CEA6) fusion protein 



1 


M 


G 


SHHHHHHGSIQGRS 


P G T E P PTQKPKKIV 


30 


31 


N 


A 


KKDVVNTKMFEELK 


SRLDTLAQEVALLK 


60 


61 


EQQALQTGSQVQLQQS 


GAEVKKPGSSVKVS 


90 


91 


C 


K 


ASGGTFSNSPINWIi 


RQAPGQGLEWMGSI 


120 


121 


I 


P 


SFGTANYAQKFQGR 


LTITADESTSTAYM 


150 


151 


E 


L 


SSLRSEDTAVYYCA 


GRSHNYELYYYYMD 


180 


181 


V 


W 


GQGTMVTVSSGGGG 


SGGGGSGGGGSDIQ 


210 


211 


M 


T 


QSPSTLSASIGDRV 


TITCRASEGIYHWL 


240 


241 


A 


W 


YQQKPGKAPKLLIY 


KASSLASGAPSRFS 


270 


271 


G 


S 


GSGTDFTLTISSLQ 


PDDFATYYCQQYSN 


300 


301 


y 


P 


LTFGGGTKLEIKRA 


AAEQKLISEEDLNG 


330 



331 A * 



Fig. 18 
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H6FXscFv(CEA6)-tripb-scFv(CEA6) fusion protein 

IMGSHHHHHHGSIQGRSQVQLQQSGAEVKKP 30 
31GSSVKVSCKASGGTFSNSPINWLRQAPGQ6 60 
61LEWMGSIIPSFGTANYAQKFQGRLTITADE 90 
QlSTSTAYMEIiSSLRSEDTAVYYCAGRSHNYE 120 
121 LYYYYMDVWGQGTMVTVSSGGGGSGGGGSG 150 
151 GGGSDIQMTQSPSTLSASIGDRVTITCRAS 180 
181 EGIYHWLAWYQQKPGKAPKLLIYKASSLAS 210 
211 GAPSRFSGSGSGTDFTLTISSIiQPDDFATY 240 
241 YCQQYSNYPLTFGGGTKLEIKRAAAEQKLI 270 
271 SEEDLNGAGTEPPTQKPKKIVNAKKDVVNT 300 
301 KMFEELKSRIiDTLAQEVALLKEQQALQTGS 330 
331QVQLQQSGAEVKKPGSSVKVSCKASGGTFS360 
361 NSPINWLRQAPGQ6LEWMGSIIPSFGTANY 390 
391 AQKFQGRLTITADESTSTAYMELSSLRSED 420 
421 TAVYYCAGRSHNYELYYYYMDVWGQGTMVT 450 
451 VSSGGGGSGGGGSGGGGSDIQMTQSPSTLS 480 
481 ASIGDRVTITCRASEGIYHWLAWYQQKPGK 510 
511 APKLLIYKASSLASGAPSRPSGSGSGTDFT 540 
541 LTISSLQPDDFATYYCQQYSNYPLTFGGGT 570 
571KLEIKRAAAEQKLISEEDLNGA* 592 
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a diagnostic method practised on the human/animal body, the search has been 
carried out and based on the alleged effects of the compound/composition. 
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